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Preface 


This text is a basic introduction to those areas of discrete mathematics of interest to 
students of mathematics. Introductory courses on this material are now standard at 
many colleges and universities. Usually these courses are of one semester’s duration, 
and usually they are offered at the sophomore level. 

Very often this will be the first course where the students see several real proofs. 
The preparation of the students is very mixed, and one cannot assume a strong back¬ 
ground. In particular, the instructor should not assume that the students have seen a 
linear algebra course, or any introduction to number systems that goes beyond col¬ 
lege algebra. 

In view of this, I have tried to avoid too much sophistication, while still retain¬ 
ing rigor. I hope I have included enough problems so that the student can reinforce 
the concepts. Most of the problems are quite easy, with just a few difficult exercises 
scattered through the text. If the class is weak, a small number of sections will be too 
hard, while the instructor who has a strong class will need to include some supple¬ 
mentary material. I think this is preferable to a book at a higher mathematical level, 
one that scares away the weaker students. 

Readership 

While the book is primarily directed at mathematics majors and minors, this material 
is also studied by computer scientists. The face of computer science is changing due 
to the influence of the internet, and many universities will also require a second 
course, with more specialized material, but those students also need the basics. 

Another developing area is the course on mathematical applications in the mod¬ 
em world, aimed at liberal arts majors and others. Much of the material in those 
courses is discrete. I do not think this book should even be considered as a text for 
such a course, but it could be a useful reference, and those who end up teaching 
such a course will also find this text useful. Discrete mathematics is also an elective 
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topic for mathematically gifted students in high schools, and I consulted the Indiana 
suggested syllabus for such courses. 

Outline of Topics 

The first two chapters include a brief survey of number systems and elementary set 
theory. Included are discussions of scientific notation and the representation of num¬ 
bers in computers; topics that were included at the suggestion of computer science 
instructors. Mathematical induction is treated at this point although the instructor 
could defer this until later. (There are a few references to induction later in the text, 
but the student can omit these in a first reading.) 

I introduce logic along with set theory. This leads naturally into an introduction 
to Boolean algebra, which brings out the commonality of logic and set theory. The 
latter part of Chapter 3 explains the application of Boolean algebra to circuit theory. 

I follow this with a short chapter on relations and functions. The study of rela¬ 
tions is an offshoot of set theory, and also lays the foundation for the study of graph 
theory later. Functions are mentioned only briefly. The student will see them treated 
extensively in calculus courses, but in discrete mathematics we mostly need basic 
definitions. 

Enumeration, or theoretical counting, is central to discrete mathematics. In Chap¬ 
ter 5 I present the main results on selections and arrangements, and also cover the 
binomial theorem and derangements. Some of the harder problems here are rather 
challenging, but I have omitted most of the more sophisticated results. 

Counting leads naturally to probability theory. I have included the main ideas 
of discrete probability, up to Bayes’ theorem. There was a conscious decision not 
to include any real discussion of measures of central tendency (means, medians) or 
spread (variance, quartiles) because most students will encounter them elsewhere, 
e.g., in statistics courses. 

Graph theory is studied, including Euler and Hamilton cycles and trees. This is 
a vehicle for some (easy) proofs, as well as being an important example of a data 
structure. 

Matrices and vectors are defined and discussed briefly. This is not the place for 
algebraic studies, but matrices are useful for studying other discrete objects, and this 
is illustrated by a section on adjacency matrices of relations and graphs. A number of 
students will never study linear algebra, and this chapter will provide some founda¬ 
tion for the use of matrices in programming, mathematical modeling, and statistics. 
Those who have already seen vectors and matrices can skip most of this chapter, but 
should read the section on adjacency matrices. 

Chapter 9 is an introduction to cryptography, including the RSA cryptosystem, 
together with the necessary elementary number theory (such as modular arithmetic 
and the Euclidean algorithm). Cryptography is an important application area and is 
a good place to show students that discrete mathematics has real-world applications. 
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Moreover, most computer science majors will later be presented with electives in this 
area. The level of mathematical sophistication is higher in parts of this chapter than 
in most of the book. 

The final chapter is about voting systems. This topic has not been included in very 
many discrete mathematics texts. However, voting methods are covered in many of 
the elementary applied mathematics courses for liberal arts majors, and they make a 
nice optional topic for mathematics and computer science majors. 

Perhaps I should explain the omissions rather than the inclusions. I thought the 
study of predicates and quantifiers belonged in a course on logic rather than here. 
I also thought lattice theory was too deep, although it would fit nicely after the section 
on Boolean forms. 

There is no section on recursion and recurrence relations. Again, this is a deep 
area. I have actually given some problems on recurrences in the induction section, but 
I thought that a serious study belongs in a combinatorics course. Similarly, the deeper 
enumeration results, such as counting partitions, belong in higher-level courses. 

Another area is linear programming. This was once an important part of dis¬ 
crete mathematics courses. But, in recent years, syllabi have changed. Nowadays, 
somewhat weaker students are using linear programming, and there are user-friendly 
computer packages available. I do not think that it will be on the syllabus of many of 
the courses at which this book is aimed. 

Problems and Exercises 

The book contains a large selection of exercises, collected at the end of sections. 
There should be enough for students to practice the concepts involved. Most are 
straightforward; in some sections there are one or two more sophisticated questions 
at the end. 

A number of worked examples, called Sample Problems, are included in the 
body of each section. Most of these are accompanied by a Practice Exercise, de¬ 
signed primarily to test the reader’s comprehension of the ideas being discussed. It 
is recommended that students work all the Practice Exercises. Complete solutions 
are provided for all of them, as well as brief answers to the odd-numbered problems 
from the sectional exercise sets. 

Gender 

In many places a mathematical discussion involves a protagonist—a person who flips 
a coin or deals a card or traverses a road network. These people used to be exclusively 
male in older textbooks. In recent years this has rightly been seen to be inappropriate. 
Unfortunately this has led to frequent repetitions of nouns—“the player’s card” rather 
than “his card”—and the use of the ugly “he or she.” 
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I decided to avoid such problems by a method that was highly appropriate to this 
text: I flipped a coin to decide whether a character was male or female. If the reader 
detects an imbalance, please blame the coin. 

There were two exceptions to this rule. Cryptographers traditionally write about 
messages sent from Alice (A) to Bob (B), so I followed this rule in discussing RSA 
cryptography. And in the discussion of the Monty Hall problem, the game show host 
is male, in honor of Monty, and the player is female for balance. 

Acknowledgments 

My treatment of discrete mathematics owes a great deal to many colleagues and 
mathematicians in other institutions with whom I have taught or discussed this ma¬ 
terial. Among my influences are Roger Eggleton, Ralph Grimaldi, Dawit Haile, Fred 
Hoffman, Bob McGlynn, Nick Phillips, Bill Sticka, and Anne Street, although some 
of them may not remember why their names are here. 

I am grateful for the constant support and encouragement of the staff at 
Birkhauser. 

Carbondale, USA W.D. Wallis 


Contents 


1 Properties of Numbers. 1 

1.1 Numbers. 1 

1.2 Sums. 9 

1.3 Bases. 15 

1.4 Scientific Notation. 20 

1.5 Arithmetic in Computers. 24 

2 Sets and Data Structures. 31 

2.1 Propositions and Logic. 31 

2.2 Elements of Set Theory . 38 

2.3 Proof Methods in Set Theory. 45 

2.4 Some Further Set Operations. 52 

2.5 Mathematical Induction. 57 

3 Boolean Algebras and Circuits . 67 

3.1 Boolean Algebra. 67 

3.2 Boolean Forms. 73 

3.3 Finding Minimal Disjunctive Forms. 79 

3.4 Digital Circuits . 86 

4 Relations and Functions . 93 

4.1 Relations . 93 






















Contents 


xii 

4.2 Some Special Kinds of Relations. 100 

4.3 Functions. 105 

5 The Theory of Counting. 113 

5.1 Events. 113 

5.2 Unions of Events. 121 

5.3 One-to-One Correspondences and Infinite Sets. 129 

5.4 Arrangement Problems. 133 

5.5 Selections. 140 

5.6 The Binomial Theorem and Its Applications. 151 

5.7 Some Further Counting Results. 156 

6 Probability. 165 

6.1 Probability Measures . 166 

6.2 Repeated Experiments . 177 

6.3 Counting and Probability. 185 

6.4 Conditional Probabilities. 191 

6.5 Bayes’ Formula and Applications. 203 

7 Graph Theory. 215 

7.1 Introduction to Graphs. 215 

7.2 The Konigsberg Bridges; Travers ability. 222 

7.3 Walks, Paths, and Cycles. 232 

7.4 Distances and Shortest Paths. 236 

7.5 Trees. 243 

7.6 Hamiltonian Cycles. 252 

7.7 The Traveling Salesman Problem. 257 

8 Matrices. 263 

8.1 Vectors and Matrices. 263 

8.2 Properties of the Matrix Product. 271 

8.3 Systems of Linear Equations. 277 






























Contents xiii 

8.4 More About Linear Systems and Inverses . 284 

8.5 Adjacency Matrices. 291 

9 Number Theory and Cryptography. 297 

9.1 Some Elementary Number Theory. 297 

9.2 Modular Arithmetic. 303 

9.3 An Introduction to Cryptography . 310 

9.4 Substitution Ciphers. 317 

9.5 Modern Cryptography. 323 

9.6 Other Cryptographic Ideas. 329 

9.7 Attacks on the RSA System. 333 

10 The Theory of Voting. 339 

10.1 Simple Elections. 339 

10.2 Multiple Elections. 350 

10.3 Fair Elections. 359 

10.4 Properties of Electoral Systems. 369 

Solutions to Practice Exercises. 379 

Answers to Selected Exercises. 409 


Index 


423 




















1 


Properties of Numbers 


When we study numbers, many of the problems involve continuous properties. Much 
of the earliest serious study of mathematics was in geometry, and one essential prop¬ 
erty of the real world is that between any two points there is a line segment that is 
continuous and infinitely divisible. All of calculus depends on the continuous nature 
of the number line. Some of the most famous difficulties of Greek mathematics in¬ 
volved the existence of irrational numbers and the fact that between any two real 
numbers one can always find another number. 

But the discrete properties of numbers are also very important. The decimal nota¬ 
tion in which we usually write numbers depends on properties of the number 10, and 
we can also study the (essentially discrete) features of representations where other 
positive whole numbers take the role of 10. 

When we write the exact value of a number, or an approximation to its value, we 
use only the ten integers 0, 1,2, 3, 4, 5, 6, 7, 8, 9, together with the decimal point. 
When numbers are represented in a computer, only integers are used. So it is impor¬ 
tant to understand the discrete properties of numbers to talk about their continuous 
properties. 


1.1 Numbers 

Sets and Number Systems 

All of discrete mathematics—and, in fact, all of mathematics—rests on the founda¬ 
tions of set theory and numbers. In this first section we remind you of some basic 
definitions and notations. Further properties of numbers will be explored in the rest 
of this chapter; sets will be discussed further in Chapter 2. 

We use the word set in everyday language: a set of tires, a set of saucepans. In 
mathematics you have already encountered various sets of numbers. We shall use 
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1 Properties of Numbers 


set to mean any collection of objects, provided only that there is a well-defined rule, 
called the membership law , for determining whether a given object belongs to the 
set. The individual objects in the set are called its elements or members and are said 
to belong to the set. If S is a set and s is one of its elements, we denote this fact by 
writing 

S £ S , 

which is read as “5 belongs to S” or “s is an element of S' 

The notation S c T means that every member of S is a member of T : 

X £ S = ) > X £ T . 

Then S is called a subset of T . This definition allows for the possibility that S = T, 
and, in fact, a set is considered to be a subset of itself. If S c T, but S ^ T we say 
S' is a proper subset of T and write S C T. 

One way of defining a set is to list all the elements, usually between braces; 
thus the set of the first three members of the English alphabet is {a, b, c}. If S is the 
set consisting of the numbers 0, 1, and 3, we could write S = {0, 1, 3}. We write 
{1,2, ..., 16} to mean the set of all whole numbers from 1 to 16. This use of a string 
of dots is not precise, but is usually easy to understand. Another method is the use 
of the membership law of the set: for example, since the numbers 0, 1, and 3 are 
precisely the numbers that satisfy the equation x 3 — 4x 2 + 3x = 0, we could write 
the set S as 

S = {x : x 3 — 4x 2 + 3x = 0} or S = {x|x 3 — 4x 2 + 3x = 0} 

(“the set of all x such that x 3 — 4x 2 + 3x = 0”). Whole numbers are called integers , 
and integral means “being an integer,” so the set of whole numbers from 1 to 16 is 

{x : x integral, 1 < x < 16}. 

Sample Problem 1.1. Write three different expressions for the set with elements 
1 and —1. 

Solution. Three possibilities are {1, —1}, {x : x 2 = 1}, and “the set of square 
roots of 1.” There are others. 

Practice Exercise. Write three different expressions for the set with elements 1, 
2, and 3. 

Some sets of numbers are so important that special names are given to them. 
The set of all integers, or whole numbers, is denoted Z. With this notation, the set 
{1,2, ..., 16} can be written 

{x : x g Z, 1 < x < 16}. 
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The positive integers or natural numbers , usually denoted z+ or N, are the inte¬ 
gers greater than 0. Another important set of integers is the set Z° of nonnega¬ 
tive integers. We sometimes write Z + = {1, 2, 3, ...}, Z° = {0, 1, 2, 3, ...}, and 
Z = {..., —3, —2, —1, 0, 1, 2, 3, ...}. The use of a string of three dots without a 
terminating number (called an ellipsis) is understood to mean that the set continues 
without end. Such sets are called infinite (as opposed to finite sets like {0, 1, 3}). The 
number of elements in a finite set S is called the order of S and is denoted | S |; for 
example, if S = {0, 1, 3} then \S\ = 3. 

The rational numbers Q consist of all fractions whose denominator is not 0: 

Q = I — ! p £ Z, q £ Z, c[ ^ 0 . 


Equivalently, it can be shown that Q is the set of all numbers with a repeating or 
terminating decimal expansion. Examples are 


3 

- = 0.428571428571.... 

7 

In the last example, the digit string 428571 repeats forever, and we usually indicate 
this by writing 


3 

7 


0.428571. 


The denominator q cannot be zero. In fact, division by zero is never possible. This 
is not an arbitrary rule, but rather it follows from the definition of division. When we 
write i = ^, we mean 4 A is the number that, when multiplied by q, gives /?.” What 
would x = 2/0 mean? There is no number that, when multiplied by 0, gives 2. 
Similarly, x = 0/0 would be meaningless. In this case there are suitable numbers x, 
in fact, every number will give 0 when multiplied by 0, but we want a uniquely 
defined answer. 

Different decimal expansions do not always mean different numbers. The excep¬ 
tion is an infinite string of 9’s. These can be rounded up: 0.9 =1, 0.79 = 0.8, and 
so on. We prove a small theorem that illustrates this fact. 


Theorem 1. 0.9 = 1. 


Proof. Suppose v = 0.9. Then lOv = 9.9 = 9 + 0.9 = 9 + v. So 9x = 9 and x = 1. 

□ 

The integers are all rational numbers, and, in fact, they are the rational numbers 
with numerator 1. For example, 5 = 5/1. 
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1 Properties of Numbers 


Each rational number has as infinitely many representations as a ratio. For exam¬ 
ple, 


1/2 = 2/4 = 3/6 = • • •. 


The final number system we shall use is the set M of real numbers , consisting 
of all numbers that are decimal expansions. Not all real numbers are rational; one 
easy example is \fl. In fact, if n is any natural number other than a perfect square 
(one of 1,4, 9, 16, ...), then <Jn is not rational. Another important number that is 
not rational is the ratio tt of the circumference of a circle to its diameter. 

The number systems satisfy Z + C Z c Q c R. Rational numbers that are not 
integers are called proper fractions, and real numbers that are not rational are called 
irrational numbers. 

When a < b, the set of all real numbers x such that a < x < b is called an open 
interval and denoted (a,b)\ we write [a, b] for the set of all real numbers x with 
a < x < b] (a closed interval). Similarly, [a,b) = {x : x < x,b}, and ( a , b] 

is an interval that contains b but not a. 

The notations Z + and Z°, mentioned previously, can be extended to the other 
number systems: for example, M + is the set of positive real numbers. It is also useful 
to discuss number systems with the number 0 omitted from them, especially when 
division is involved. We denote this with an asterisk: for example, Z* is the set of 
nonzero integers. 


Factors and Divisors 

When v and y are integers, we use the phrase “x divides y” and write x\y to mean 
“there is an integer z such that y = xz,” and we say x is a divisor of y. Thus 2 divides 
6 (because 6 = 2 • 3), — 2 divides 6 (because 6 = (—2)- (—3), 2 divides —6 (because 
—6 = 2* (—3)). Some students get confused about the case y = 0, but according 
to our definition x divides 0 for any nonzero integer x . We define the factors of a 
positive integer x to be the positive divisors of x (negative divisors are not called 
factors so —2 is not a factor of 6). 

If x divides both y and z, we call x a common divisor of y and z. Among the 
common divisors of y and z there is naturally a greatest one, called (not surprisingly) 
the greatest common divisor of y and z, and denoted (y, z). If (y, z) = 1, y and z 
are called coprime or relatively prime. For example, (4, 10) = 2, so 4 and 10 are not 
coprime; (4, 9) = 1, so 4 and 9 are coprime. In the latter example we also say 4 is 
relatively prime to 9. 

A prime number is a positive integer x other than 1 whose only factors are 1 
and x. The first few primes are 2, 3, 5, 7, 11, and 13. The number 1 is excluded and 
is called a unit. 


Theorem 2. The set of all prime numbers is infinite. 
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Proof. Suppose the number of primes is finite. Then there will be some positive inte¬ 
ger n such that there exist exactly n primes. Suppose the primes are p \, p 2 ,..., p n - 
Now consider the number 

X = P\ X P2 x • • ■ X Pn + 1. 

Clearly, dividing x by p\ would leave a remainder of 1, and similarly for the other 
Pi\ so x is not divisible by any of p\, p 2 ,..., p n . Either it is prime, or its prime 
divisors are outside the set of all primes—but the latter case is impossible. So the 
assumption that the number of primes is finite must be false. □ 

Any positive integer x can be written as a product 

X = X\ X X2 x • • • X Xfc, 

where the x z - are all primes. So every positive integer is the product of prime factors. 
For example, 36 is the product 2 x 2 x 3 x 3. We usually collect all the equal factors 
and use an exponent, so we write 36 = 2 2 x 3 2 . This prime factor decomposition is 
unique: for example, if 

2 a 3 b 5 c = 2 x 3 y T, 


it must be true that a = x, b = y, and c and z are both zero. We shall look at this 
further in Section 9.1. 

Sample Problem 1.2. Use the prime factor decomposition to find the greatest 
common divisors of each of the following numbers with 224: 16, 53, 63, 84, 97. 

Solution. 224 = 2 5 • 7, and 16 = 2 4 , 53 is prime, 63 = 3 2 • 7, 84 = 2 2 • 3 • 7, 97 
is prime. So (16, 224) = 16, (53, 224) = 1, (63, 224) = 7, (84, 224) = 28, and 
(97,224) = 1. 

Practice Exercise. Use the prime factor decomposition to find (72, 84) and 
(56, 42). 


Exponents and Logarithms 

If x is a positive integer, b x is the product of x copies of b\ b x = b x b x • • • x b 
(x factors). In this expression b is called the base and x the exponent. It is easy to 
deduce such properties as 

b x b y =b x + y , 

(b x ) y =b xy , 

(ab) x =a x b x . 

Negative exponents are handled by defining b~ x = and also b° = 1 whenever 
b is nonzero. The multiplication rule leads us to define h* to be the xth root of b. 
(When x is even, we take the positive root for positive b and say b * is not defined 
for negative b.) 
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Sample Problem 1.3. Express the following in the simplest form—as decimal 
numbers if possible: 

(6i2)°, (* 3 ) 5 , (* 4 ) 0 , (—ior 4 , jE, 

Solution. (612)° = 1 (Z?° = 1 for any b ); (x 3 ) 5 = x 3 ' 5 = x 15 ; (x 4 ) 0 = 1 
(again, b° = 1 for any b , or you could also argue that (x 4 ) 0 = x 4 ' 0 = x° = 1 ); 
(-fO )- 4 = ^ ^ ^ = 0.0001; ^ = 10 2 = 100. 

Practice Exercise. Do the same to 

l' J . 0 5 . £ (- 2 )-'. 

Sample Problem 1.4. Express in the simplest possible form, with positive expo¬ 
nents: 

-E-, (3a 2 ) (5a- 3 ). 

Solution. 

-L = ( x -4)-l = x (-4)-(-l) = x 4. 

—2 3 

U o _1 _ 9 O V 

— = ( u 2 )(v 3 ) 1 = u 2 v 3 = - = ~; 

V 3 u z 

( 3a 2 )(5a ~ 3 ) = 3 • 5 • a 2 • a ~ 3 = 15 a~ l = —. 

a 

Practice Exercise. Express in the simplest possible form, with positive expo¬ 
nents: 

(4*- 2 )(3* 4 ). 

The process of taking powers can be inverted. The logarithm of x to base b , or 
log^ x, is defined to be the number y such that b y = x. Clearly log^ x is not defined 
if b is 1 (if x = 1 , any y would be suitable, and if x 7 ^ 1 , no y would work). 

Sample Problem 1.5. What are log 2 8 and log 16 4? 


Solution. 2 3 = 8 , so log 2 8 = 3. \/l 6 = I 62 = 4, so log 16 4 = 


Practice Exercise. What are log 3 9, log 5 125 and log 4 2? 

Several properties of logarithms follow from the elementary properties of expo¬ 
nents. In particular 
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log* MU = log* M + log* U, 
log* (m _ 1 ) = log* M, 
log*(x- v ) = vlog*.v. 
log^ 1=0 for any b. 

Theorem 3. For any x and any base b, x = log^ b x . 

Proof. By definition, u = b\og b u. Putting u = b x we have bx = b(log b b x ). So 
comparing the exponents, x = log^ b x . □ 

Sample Problem 1.6. Evaluate 125 log 5 2 . 

Solution. 

125 log 5 2 = ( 5 3 yog 5 2 
_ ^3log 5 2 
= 5log 5 2 3 
— 5l0g5 8 
= 8 . 

Practice Exercise. Evaluate log 3 (9V3) 5 . 


Absolute Value, Floor, and Ceiling 

The absolute value or modulus of the number x, which is written |jc|, is the nonneg¬ 
ative number equal to either x or —x. For example, |5.3| = 5.3, |—7.21 = 7.2. (Do 
not confuse this with the notation for set order.) 

The floor |_*J of x is the largest integer not greater than x. If x is an integer, 
|_xj = v. Some other examples are |_6.1J = 6, [—6.1} = —7. It is easy to deduce the 
following properties: 

(1) |_xj —n if and only if n is an integer and n < x < n + 1. 

(2) If v is nonnegative, then L*J equals the integer part of x. If x is negative and 
nonintegral, then L*J is one less than the integer part of x. 

(3) If n and k are integers, then k divides n if and only if | = |_^J. 

The ceiling [x~\ is defined analogously as the smallest integer not less than x. 
Sample Problem 1.7. What are |_6.3_|, T7.2], \-2A\ |3.4|, |-3.4|? 


Solution. 6, 8, —2, 3.4, 3.4. 
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Practice Exercise. What are |_2.6_|, |_— 1.7J, [4.8], f—4], |3.1|, |—4.4|? 


Exercises 1.1 


Are the given statements true or false in Exercises 1 to 10? 

1. 3e {2, 3,4,6}. 2. 4 £ {2, 3, 4, 6}. 

3. 5 e {2, 3,4,6}. 4. {3,2} = {2,3}. 


5. {1,2} e {1,2,3} 
7. 5 e {1,3,4, 7}. 

9. 4 e {1,3, 4, 7}. 


6 . {1,2} = {1,2,3}. 
8. 6 i {1,3, 4, 7}. 

10. {1,3} = {1,2,3}. 


In Exercises 11 to 15, write the list of all members of the set. 

11. {x : x is a month whose name starts with J}. 

12. {x : x is an odd integer between —6 and 6}. 

13. {x : x is a letter in the word “Mississippi”}. 

14. {x : x is an even positive integer less than 12}. 

15. {x : v is a color on the American flag}. 

16. Which of the following are true? 

(i) All natural numbers are integers. 

(ii) All integers are natural numbers. 

17. For each of the following numbers, to which of the sets Z + , Z, Q, M does it 
belong? 

(i) -2.13, (iii) Vi, (v) 1.834, 

(ii) V5, (iv) 2: r, (vi) 2 + V2. 

18. For each of the following numbers, to which of the sets Z + , Z, Q, R does it 
belong? 

(i) 1.308, (iii) -7, (v) 10,508, 

(ii) 1.3, (iv) V3, (vi) 1 - >/7. 


In Exercises 19 to 21, list all the factors of the number. 


19. 

36. 

20. 50. 

21. 

72. 

22. 

61. 

23. 63. 

24. 

24. 

25. 

29. 

26. 70. 

27. 

48. 
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In Exercises 28 to 36, decompose the two numbers into primes and then compute 
their greatest common divisor. 

28. 56 and 63. 29. 231 and 275. 30. 444 and 629. 

31. 95 and 125. 32. 462 and 252. 33. 88 and 132. 


34. 256 and 224. 


35. 1080 and 855. 36. 168 and 231. 


In Exercises 37 to 44, simplify the expression, writing the answer using positive ex¬ 
ponents only. 

37. t 3 t~ 3 . 38. (2x 2 y~ 3 ) 2 . 


39. &£. 

xy z 

41. (2 xy)~ 2 . 


40. 


5 6 2 4 

10 3 ' 


42. x 4 x~ 4 . 


43. 5yV 3 . 


44. (3x 2 ) 3 (2x)~ 4 . 


In Exercises 45 to 59, evaluate the expression. 


45. log 3 9. 

48. log 10 (0.1). 
51. r— 83.11. 
54. 111.4|, 

57. U-11.9IJ. 


46. log 2 
49. L28.4J. 
52. IT-7.411 
55. [77.7]. 
58. LV2J. 


47. log 25 5. 

50. r-107.7]. 
53. |L-11.9J|. 
56. n - 7.4|]. 
59. 11.731. 


1.2 Sums 

Sum Notation 

The sum of the first 16 positive integers can be written 

1+2 + 3 + 4 + 5 + 6 + 7 + 8 + 9 + 10+11 + 12+13 + 14+15 + 16, 

or more briefly 1 + 2 + • • • + 16. It should be clear that each number in the sum is 
obtained by adding 1 to the preceding number. A more precise notation is 

16 

i=i 


In the same way, 
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/(*') = /(D + /(2) + /(3) + /(4) + /(5) + /(6); 


1 = 1 


the notation means “first evaluate the expression after the (that is, /(/)) when 
i — 1, then when i = 2, , then when i = 6, and add the results.” 

Definition. A sequence (at) of length n is a set of n numbers {a\, < 22 , ..., ^}, or 
: 1 < i < n}. The set of numbers is ordered — a\ is first, a 2 is second, and so 
on—and at , where i is any one of the positive integers 1, 2, ..., n, is called the zth 
member of the sequence. 

Definition. If (fl/) is a sequence of length n or longer, then YH= 1 +' * s defined by 
the rules 



i=l 



Usually the sigma notation is used with a formula involving i for the term fol¬ 
lowing as in the following examples. Notice that the range need not start at 1; we 
can write YH=j when j and n are any integers, provided j <n. 

Sample Problem 1.8. Write the following as sums and evaluate them: 


4 6 


y>- + D. 


i =1 i =3 


Solution. 


4 


^ ( -2 = 1 2 +2 2 + 3 2 + 4 2 


i =1 


= 1+4 + 9 + 16 
= 30; 


6 


J]/(/ + l) = 3- 4 + 4- 5 + 5- 6 + 6- 7 


i=3 


= 12 + 20 + 30 + 42 
= 104. 


Practice Exercise. Write the following as sums and evaluate them: 
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5 6 

J2 i(i - 1): 53 *• 

i =3 i =2 

Sample Problem 1.9. Write the following in sigma notation: 

2 + 6+10+14; 1 + 16 + 81. 

Solution. Ya=\ {Ai ~ 2); ELi ( ' 4 - 

Practice Exercise. Write the following in sigma notation: 

1 + 3 + 5 + 7 + 9; 8 + 27 + 64+ 125. 

Some Properties of Sums 

It is easy to see that the following properties of sums are true. 

(1) If c is any given number, then Y^= 1 c = nc. 

(2) If c is any given number and (a{) is any sequence of length n , then 

(3) If (at) and (bi) are any two sequences, both of length n , then 

n / n \ / n \ 

r. = ( a i ) + ( y. bi i 

i =1 \i= 1 / \i= 1 / 

(4) If (at ) is any sequence of length n,l<j<n, then 

n j n 

y^ 

i—i i—i i—j+1 

The following is a standard result on sums. 

Theorem 4. TTze sum of the first n positive integers is \n(n + 1), or 

n(n + 1) 



X++) = c[Y. 


i=i 


Vi=l 


Proof. Let us write s for the answer. Then s = Y%= l l - We shall define two very 
simple sequences of length n. Write a\ = i and bi = n + 1 — i. Then (at) is 
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the sequence (1,2 ,,n) and (bfi is (n, n — 1, ..., 1). They both have the same 
elements, although they are written in a different order, so they have the same sum, 

n n 

s = j2°i = J2 bi - 

i =1 i =1 

So 

n n 

2.v = y] «,+ y] /?, 

;=i (=i 

n 

= fe + bi) 

i =1 
n 

= y. (/+(» + i- o) 

i=\ 

n 

= (« + 1) 

(=i 

= + 1 ). 

Therefore, dividing by 2, we get s = n(n + l)/2. □ 

Since adding 0 does not change any sum, this result could also be stated as 

n(n + 1) 


That form is sometimes more useful. 



Sample Problem 1.10. Find the sum of the numbers from 11 to 30 inclusive. 

Solution, Yll=\ l i is required. Now 

30 10 30 

i =1 i =1 i =11 

and, using the theorem and substituting, we have 


30-31 10-11 


30 




z=ll 


30 


465 = 55 + J2 


i =11 


so £-£ n / =465 -55 = 410. 
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Sample Problem 1.11. Find 2 + 6 + 10 + 14 + 18 + 22 + • • • + 122. 
Solution. This is the sum of term 2 + 4 i, where i goes from 0 to 30: 


30 / 30 \ / 30 \ 

£(2 + 40= £2 +(I> 

i= 0 \i=0 / \i=0 / 

(§") 


30-31 

= 31 • 2 + 4 • — 

= 62+ 1860 
= 1922. 

Practice Exercise. Find 3 + 7 + • • • + 43. 

Two other standard results that will be proved in the exercises are as follows. 

Theorem 5. Y%= l * 2 

Theorem 6. Y^i= l -x* -1 = unless x = 1. 

Alternatively, Theorem 6 says 


-2 _ h(«+1)(2«+1) 

6 


r n _ 1 

i=0 


or 


W r ^z+l 1 

W = f__ 

2—1 x — l 

i=l 

(unless v = 1, in both cases). 

Exercises 1.2 

In Exercises 1 to 12, write the expression as a sum and evaluate it. 

1 - Ef=i(i 2 + D- 2 . El 3 *V-3). 


3. E+V*. 
5. E/=1 * 3 - 


4. T,i=2 (2i - !)• 


6 - ^-1 + 
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7- Ef=3i(i-2). 

8. E-=2^-D 

9- E;=io ( ' + T- 

io. el < 2 - y 

11 - E?=2i + (-!)'■ 

12 . E/U (! + * 2 )- 

13 . (Proof of Theorem 5) 


(i) Say bi = i 3 — (/ — 

l) 3 . Prove that YTi= l bi = n?> - 


(ii) Prove that bi = 3/ 2 — 3/ + 1, and therefore, 

n n n 

bi = 3 i 2 — 3 z + n. 

i =1 i=l z=l 

(iii) Use this to prove that 

2 n(n + l)(2n + 1) 

“ 6 ' 

14 . (Proof of Theorem 6) Suppose a\ = 1, = x, a?, = x 2 , and, in general, 

Cli = x l ~ l . 

(i) Prove that YTi=2 a i = x Ya =i a i- 

(ii) Use this to show that Y^= l ^ = 1 + x Y^=i a i ~ xU • 

(iii) Use part (ii) to find the value of YH= l when x ^ 1. 

(iv) Why did we have to require “x / 1” in part (iii)? 

In Exercises 15 to 18, Y^= l a i = ^ Y^= l bi = Evaluate Y^i= l c z ^ 

case. 

15 . ci = 2a t + 1. 16 . Cj = 5fl f . 

17 . a = 3a/ - b t . 18 . a = a,- + 2fc,-. 

Use £/ze standard results of Theorems 4, 5, aad 6 aaJ the four properties of the sums 
to evaluate the expressions in Exercises 19 to 30. 


io. e‘= 7 a - 3). 

20. Els (5* 2 - 4). 

21- Et 2 2 ! '- 

22. E"=i 3 ‘- 

23. E^s (2* + 1). 

24. E'E O ' 2 + 1) 

25. E?=i(i 2 + D. 

26. E"=i (* 2 - !)■ 

27. ELi2- ; . 

28. E"=i (2* 2 - i). 

29. EE, (1 + if- 

30. EEi a -1) 2 





1.3 Bases 


15 


1.3 Bases 

Arithmetic in Various Bases 

In ordinary arithmetic we use ten digits or one-symbol numbers {0, 1, 2, 3, 4, 5, 6, 7, 
8, 9} to write all the possible numbers. The symbol for “ten” is 10, meaning “once 
ten plus zero times one.” For example, 243 means “twice ten-squared plus four times 
ten plus three.” In general, suppose ao, a\ , a 2 , ..., and b\ , Z? 2 , &3 , ..., are any digits. 

When we write the number ... < 22^1 < 20-^1 ^2^3 •.. it means-|- <22 • 10 2 + a\ • 10 1 T 

ao • 10 ° + b\ • 10 -1 + Z ?2 * 10 -2 + £3 • 10 -3 + • • • or in sigma notation, 

. io ; + • 10 -;. 

i >0 j >0 


There is no special reason for choosing ten. If the base b were used, we might 
use 243 to mean “twice /^-squared plus four times b plus three.” We shall refer to this 
as an expression in base b. To avoid confusion, it will be necessary to write down the 
base. We shall write (243)^ to mean the number in base b, so that 

(243 ) b = 2 b 2 +4/7 + 3. 


Similarly, 

(0.73)* = lb~ l + 3 b~ 2 . 

When no subscript is used, the usual base (base 10) is intended. Another com¬ 
mon notation when the base is 2 is to write B after the number because num¬ 
bers written in base 2 are called binary numbers. So 101.115 means the same as 
( 101 . 11 ) 2 . 

To write regular (base 10) numbers, ten digits are used, but when we write binary 
numbers, only the two digits 0 and 1 are necessary. Similarly, in base b, we need b 
digits. If b is greater than 10, some new symbols must be invented—for example, in 
the base 16, which is called hexadecimal and is often used in computer applications, 
A , 5, C, D , E , and F are used for 10, 11, 12, 13, 14, and 15. 


Change of Base 

Sample Problem 1.12. Convert (243)7 and (104)5 to base 10. 


(243)v = 2-7 2 + 4-7 + 3 
= 98 + 28 + 3 
= 129; 


Solution. 
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(104)5 = 1 - 5 2 + 0-5 + 4 
= 25 + 4 
= 29. 

Practice Exercise. What are (144)5, (203)7, (112)3? 

The process is just the same for nonintegers, as the following sample problem 
shows. 

Sample Problem 1.13. Convert (0.231)5 and (104.231)5 to base 10. 

Solution. 

(0.231)5 = 2 • 5 _1 + 3 • 5“ 2 + 1 • 5“ 3 
= 2 • 0.2 + 3 • 0.04 + 1 • 0.008 
= 0.4 + 0.12 + 0.008 
= 0.528. 

Using this and the preceding sample problem, 

(104.231)5 = (104)5 + (0.231)5 
= 29 + 0.528 
= 29.528. 

Practice Exercise. What are (0.242)5, (144.242)5? 

To convert from base 10 to another base, use continued division. 

Sample Problem 1.14. What is 108 in base 1? 

7 108 

Solution. 7 15 + 3 . So 108 = (213) 7 . 

2 + 1 

Practice Exercise. What is 54 in base 5? What is 103 in base 6? 


To discuss the conversion of nonintegers from another base to base 10, look again 
at the second sample problem. Say * = (0.231)5. Then 5x = 2 + (0.31)5. So when 
we multiply by the base (in this case 5), the integer part of the result is the first digit 
of the expansion. Then 


5x — 2 = (0.31)5, 

5(5*-2) = 3 + (0.1) 5 . 

So multiplying the remainder by the base gives the second digit, and so on. 





1.3 Bases 


17 


Sample Problem 1.15. Express 0.71875 in base 2. 

Solution. 


2 - (0.71875) 
2 • (0.4375) 
2 • (0.875) 
2 • (0.75) 
2 • (0.5) 


1.4375, first digit 1, 
0.875, second digit 0, 
1.75, third digit 1, 

1.5, fourth digit 1, 
1.0, fifth digit 1. 


So 0.71875 = (0.10111)2 = 0.101115. 


Practice Exercise. Express 0.40625 in base 2. 

Just as in base 10, some numbers have terminating expressions and others recur. 
See the next sample problem. 

Sample Problem 1.16. Express 33.7125 in base 4. 

4k33_ 

Solution. 4 8 + 1, so 33 = (201)4. 

2 + 0 


4-0.7125 = 2.85, 

first digit 2, 

4-0.85 = 3.4, 

second digit 3, 

vq 

II 

d 

third digit 1, 

4 • 0.6 = 2.4, 

fourth digit 2, 

II 

d 



At this point we have been asked to repeat an earlier calculation, so the whole 
process will recur: 

0.7125 = (0.231212121212.. .) 4 = (0.23l2) 4 . 

Adding, 33.7125 = (201.23l2) 4 . 

Practice Exercise. Express 53.12 in base 6. 

Conversion between binary and hexadecimal numbers is particularly easy. Each 
hexadecimal number can be expressed as a four-digit binary number, provided lead¬ 
ing zeros are included. The conversion table is as follows: 





18 


1 Properties of Numbers 


0i6 — OOOO2, 
lie = 0001 2 , 
2i6 = OOIO2, 

316 =00112, 


4i6 = OIOO 2 , 

516 = 0101 2 , 
616 = 01102 , 
716 = 01112, 


8 16 = IOOO 2 , 
916 = 10012, 
Ai 6 = 10102, 
^16 = 10112 , 


Ci6 = 1100 2 , 
Di6 = 11012, 
^16 = 1110 2 , 
^16 = Hll2. 


To change hexadecimal to binary, simply use the table to make replacements, and 
then delete the excess zeros. To convert binary to hexadecimal, first add zeros at both 
ends to make the number of symbols before the decimal point and the number after 
both multiples of 4 and use the table again. 


Sample Problem 1.17. Convert 3A04.A4from hexadecimal to binary. 


Solution. Replacing term by term we get 

0011 1010 0000 0100.1010 0100, 


so the answer is 


11 1010 0000 0100.1010 01 . 


(Notice the use of spaces to make the numbers more readable.) 


Practice Exercise. Convert 553.76 from hexadecimal to binary. 
Sample Problem 1.18. Convert 10 IIOI.III 2 to hexadecimal. 
Solution. 10 IIOI.III 2 = 0010 1101.1110 2 = 2 D.E. 


Practice Exercise. Convert 10 1101 IOII.OI 2 to hexadecimal. 

Exercises 1.3 


In Exercises 1 to 39, express the number in base 10. 


1. 

110111 2 . 

2. 1314 7 . 

3. 

CT\ 

00 

0 

<N 

4. 

IOIIOI3. 

5. 6443 8 . 

6. 

1100112 

7. 

IIIIIIII2. 

8. 102 7 . 

9. 

115 9 . 

10. 

10101 2 . 

11. 121213. 

12. 

00 

0 

13. 

00 

0 

<N 

14. 2315. 

15. 

11.3 5 . 
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16. 17.46 9 . 

17. 1.1 3 . 

18. IIOIII.OI 2 . 

19. 22 . 67 . 

20. 0.04 8 . 

21. 210.04 8 . 

22. 101.T4 6 . 

23. 32.03 6 . 

24. IOOOIOOO.OOOI 2 . 

25. 141.22 5 . 

26. 50.216. 

27. 0.01 2 . 

28. 0.101 2 . 

29. 10101.101 2 . 

30. II. 2 I 3 . 

31. 112.2 3 . 

32. 2.031 4 . 

33. 101.31 6 . 

34. l.I 6 . 

35. 1011.1 5 . 

36. 44041.22 8 . 

37. II. 24 . 

38. 2.44 8 . 

39. 7.41 9 . 

In Exercises 40 to 54, express the number in base 2. 


40. 253. 

41. 43. 

42. 0.2. 

43. 55.75. 

44. 8. 

45. 14.83. 

46. 228. 

47. 126. 

48. 13.125. 

49. 111. 

50. 0.25. 

51. 126.25. 

52. 12.45. 

53. 1819. 

54. 517.8. 

In Exercises 55 to 69, express the number in base 6 . 


55. 4.1. 

56. 1011. 

57. 11.735. 

58. 2.2. 

59. 0.3. 

60. 75.8. 

61. 119. 

62. 51.4. 

63. 1104. 

64. 3.205. 

65. 14.4. 

66 . 112.2. 

67. 26.3. 

68 . 77. 

69. 28.4. 

In Exercises 10 to 84, express the number in hexadecimal (base 16) using the symbols 
0, 1, ...,9 ,A,B,C,D,E,F. 

70. 723. 

71. 108. 

72. 88.91. 

73. 11.5. 

74. 7.16. 

75. 14.03. 

76. 2.12. 

77. 255. 

78. 104. 

79. 27. 

80. 10.5. 

81. 1.12. 
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82. 7.8. 

83. 64.6. 

84. 257. 

In Exercises 85 to 92, convert the binary number to hexadecimal. 

85. 1001 0010 0100. 

86. 

1001.1101 101. 

87. 101 1010 1101.11. 

88. 

o 

o 

o 

p 

89. 101.0010 1110 Oil. 

90. 

1.0110 111. 

91. Ill 0110 1011.01. 

92. 

i oioi.noo llol. 

In Exercises 93 to 100, convert the hexadecimal number to binary. 

93. 1A01. 

94. A.0B. 

95. 1101. 

96. 912.£5. 

97. AE.EE. 

98. 213 F.EE. 

99. 1£4.A. 

100. 5EE.C. 



1.4 Scientific Notation 

Floating Point Numbers 

It is common to write very large or very small numbers in scientific (or exponential) 
notation—as an example, two million million million million is written as 2 x 10 24 , 
rather than 2 followed by 24 zeroes. There can be various numbers of digits before 
the decimal point—for example, 24.53 can be written as 0.2453 x 10 2 or 2453 x 10 -2 , 
or even 24.53 x 10°. In any of these expressions we refer to the first number as the 
mantissa and the power as the exponent —so 2.453 x 10 3 has mantissa 2.453 and 
exponent 3. The part of the mantissa to the right of the decimal point is called the 
fraction. All computers use some form of exponential notation to represent noninte¬ 
gers. 

Floating point notation is a special form of scientific notation. The mantissa has a 
fixed number of digits, which we shall call the length , and the exponent is chosen so 
that there is exactly one digit to the left of the decimal point. If calculations are being 
carried out in base 10, the absolute value |/| of the mantissa satisfies 1 < \ f\ < 10. 
A number whose mantissa satisfies this equation is called normalized —for example, 
the normalized form of 211.7 is 2.117 x 10 2 . The forms 211.7 x 10° and 0.2117 x 10 3 
will not be used—in the first one the mantissa is too large and in the second it is 
too small. The only exception to the rule is the number zero, which has 0 as its 
mantissa. One can also use floating point notation in bases other than 10—in base 2, 
a normalized number would have the form f x 2 e , where 1 < \ f\ < 2; unless we 
say otherwise, we assume the base is 10. 
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An alternative convention that is widely used is to require the mantissa / to 
satisfy 0.1 < | /1 < 1, so that the part before the decimal point is 0 and the first digit 
after the point is nonzero. We shall not use that method here, but you may encounter 
it in other books. 

Sample Problem 1.19. In a floating point system of length 4, what are the rep¬ 
resentations of 12.34, 101.2, —0.0012? 

Solution. 1.234 x 10 1 , 1.012 x 10 2 , -1.200 x 10“ 3 . 

Practice Exercise. In a floating point system of length 4, what are the represen¬ 
tations of 17.5, -11.12, 1.401? 

Rounding and Dropping Digits 

Sometimes the number of digits in the mantissa of a number is greater than the 
number of digits allowed by the floating point system. For example, in a floating 
point system of length 4, how are we to write the numbers 101.73 and 21.468? We 
would represent the first as 1.017 x 10 2 and the second as 2.147 x 10 1 . In the first 
case, where the last digit was less than 5, we rounded down, and ignored it; in the 
second case, where it was greater, we rounded up and added 1 to the second to last 
digit. In the extreme case, 123.99 would become 1.240 x 10 2 —rounding up 9 yields 
10 (“carry the 1”). In the middle, with a 5, one rounds up. 

If you check by using several calculators, you will find that some of them round 
up and down according to the above rules, whereas others simply ignore the last 
digit—this is called dropping. For example, if the exact answer to a calculation is 
27.73358, and your calculator only shows six digits, it might give you 27.7335, or 
2.77335 x 10 1 in floating point form. (Usually the more expensive calculators round, 
the cheaper ones drop!) 

In the case of negative numbers, one rounds or drops on the absolute value. To 
round —2.3149 to length 4, first look at the absolute value 2.3149, which rounds to 
2.315. So -2.3149 rounds to -2.315. 

Sample Problem 1.20. Write the following numbers in floating point form, of 
length 3. 


(1) 11.48. 

(2) 5.302. 


(3) -77.335. 

(4) 3/11. 


Solution. (1) 1.15x10*, (2) 5.30 x 10°, (3) -7.73 x 10*, (4) 2.73 x 10“ 1 (since 
3/11 = 0.2727...). 

Practice Exercise. Write the following numbers in floating point form, of 
length 4. 
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(1) -804.955. 

(2) 108.798. 


(3) 1144.114. 

(4) 4/13. 


Simplified Floating Point Arithmetic 

To illustrate how floating point numbers are used in calculations, we shall use a sys¬ 
tem in which the mantissa is limited to four places. Typical numbers are 2.123 x 10 3 , 
1.584 x lO -1 , -3.113 x 10 2 . 

If two numbers have the same exponent, they are added by adding the mantissas. 
If the two exponents are different they must first be adjusted by increasing the smaller 
one. 

Sample Problem 1.21. Find 8.348 x 10 3 + 2.212 x 10 2 . 

Solution. First adjust the exponents: instead of 2.212 x 10 2 use 0.221 x 10 3 . 
(Notice that one digit is rounded; this is truncation.) Then add the mantissas: 

8.438 + 0.221 = 8.569. 

So the answer is 8.569 x 10 3 . 

Practice Exercise. Find 1.043 x 10 2 + 3.223 x 10 3 . 

Sample Problem 1.22. Find 7.124 x 10 -2 + 6.004 x 10 -2 . 

Solution. 7.124 + 6.004 = 13.128. Since 13.128 x 10 -2 is not normalized, 
divide by 10 (and add 1 to the exponent). 

13.128 x 10“ 2 = 1.3128 x 10 -1 , 

and the answer is 1.313 x 10 “ 1 . Notice that, again, a place is lost by truncation, 
and rounding to the nearer four-digit mantissa occurs. 

Practice Exercise. Calculate in floating point arithmetic: 

6.041 x 10 2 + 3.303 x 10 3 ; 

7.007 x 10“ 4 +4.644 x 10“ 4 . 

Sometimes an adjustment of the exponent can have extreme results. For example, 
in the addition 

1.753 x 10 3 + 4.004 x 10“\ 

the exponent required is 3. When 4.004 x 10“ 1 is adjusted to have exponent 3, the 
result is 0.000 x 10 3 . 

Multiplication is carried out by multiplying mantissas and adding exponents— 
normalization may then be necessary. 
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Sample Problem 1.23. Multiply (5.045 x 10 2 ) x (2.123 x 10 3 ). 

Solution. 5.045x2.123 = 10.710535. So the required product equals 10.710535 x 
10 5 . In normalized form the answer is 1.071 x 10 4 . 

Practice Exercise. Multiply 4.640 x 10 2 by 3.020 x 10 4 . 

Exercises 1.4 

1. In each case identify the mantissa and the exponent. Then write the number in 
floating point form, of length 3. 

(i) 104.53 x 10 4 , (ii) -11 x 10“ 3 , 

(iii) 11.11 x 10- 5 , (iv) 104.3 x 10 7 . 

2. In each case write down the mantissa and the exponent. Then write the number 
in floating point form, of length 4. 

(i) 72.11 x 10 4 , (ii) -423 x 10“ 2 , 

(iii) 1104.7 x 10 3 , (iv) 104.33 x 10“ 3 , 

(v) 41.147 x 10 3 , (vi) 75.33 x 10 3 . 

In Exercises 3 to 20, express the number in floating point form, of length 5. 


3. 

117.4. 

4. 

48.1179. 

5. 

3 

7* 

6. 

-23.08. 

7. 

3834.992. 

8. 

-56.44. 

9. 

-14.7909. 

10. 

224.113. 

11. 

-21.2. 

12. 

0.011212. 

13. 

208.3. 

14. 

1/3. 

15. 

-15.55. 

16. 

107.986. 

17. 

-73.48. 

18. 

108.10888. 

19. 

1122.99. 

20. 

521.713. 


In Exercises 21 to 36, carry out the additions and subtractions in a floating point 
system of length 5. 

21. 3.2084 x 10 3 + 2.7941 x 10 3 . 

22. 1.1081 x 10 2 +4.9392 x 10 3 . 

23. 7.5493 x 10 3 + 8.7143 x 10 3 . 

24. 7.4083 x 10 2 - 3.9214 x 10 2 . 

25. 6.4132 x 10 3 - 3.1102 x 10 2 . 

26. 7.8101 x 10 3 - 6.4103 x 10 4 . 
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27. 7.2163 x 10“ 3 - 8.5123 x 1(T 3 . 

28. 8.6141 x 10" 1 - 2.7814 x 1(T 2 . 

29. 1.4083 x 10 2 + 7.7411 x 10 2 . 

30. 2.7174 x 10 3 + 1.3908 x 10 2 . 

31. 7.4192 x 10 2 + 6.3142 x 10 2 . 

32. 3.6131 x 10 4 - 2.1142 x 10 4 . 

33. 4.2401 x 10 4 - 7.1432 x 10 3 . 

34. 5.8974 x 10 3 - 2.8904 x 10 4 . 

35. 7.1384 x 10“ 3 +7.2241 x 10“ 3 . 

36. 7.8418 x 10“ 3 - 6.6616 x 10“ 4 . 

In Exercises 37 to 44, carry out the multiplication in a floating point system of 
length 3. 

37. Multiply 1.14 x 10 2 by 1.48 x 10 2 . 

38. Multiply -1.73 x 10 2 by 2.18 x 10 3 . 

39. Multiply -2.85 x 10 5 by 1.17 x 10“ 4 . 

40. Multiply 5.13 x 10 4 by 1.16 x 10“ 5 . 

41. Multiply 7.20 x 10 -2 by 1.08 x 10 2 . 

42. Multiply 4.44 x 10 2 by 1.13 x 10 3 . 

43. Multiply -3.14 x 10 3 by 1.13 x 10 4 . 

44. Multiply -5.92 x 10 2 by 8.14 x 10“ 3 . 

1.5 Arithmetic in Computers 

Storing Numbers in Computers 

There are two facts about computers that you should bear in mind when thinking 
about how computers store and use numbers. First of all, a computer uses binary 
arithmetic because a computer recognizes two states—on or off, electricity flowing 
or electricity not flowing. (There are exceptions to this rule, but the computers you 
will see about you are binary.) The computer converts your input (in ordinary decimal 
notation) to binary before doing any arithmetic. 

Second, the computer is limited in size. It cannot arbitrarily decide to give more 
digits in an answer the way a human being can. A number might, for example, be 
restricted to 8 binary digits, or 16, or 64. The degree of restriction is usually built into 
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the computer hardware. Binary digits are called bits, so we refer to 8-bit, or 16-bit, 
or 64-bit numbers. 


Integers 

We shall illustrate integer arithmetic in a computer using 8-bit numbers, although 
in practice the range of numbers available in a computer is much larger. So a num¬ 
ber like 53 is stored as 00110101. (This is because 53 = 11010l2;the computer 
has enough room to put in 8 bits for each number it processes, so it includes two 
extra zeros at the beginning, called “leading zeros”.) To store the negative —53, the 
computer first calculates what is called the complement (or one’s complement) of 
00110101 by changing every 0 to a 1 and every 1 to a 0, so it calculates 11001010. 
Then it calculates the two’s complement by adding 1 to the one’s complement. So the 
two’s complement is 11001011. This is used to represent —53. In the remainder of 
this section we shall show you how this “two’s complement” representation is used. 

Sample Problem 1.24. In an 8 -bit computer, how are 83 and —22 represented? 

Solution. 83 = IOIOOII 2 . Since it is positive, the representation is just 01010011 

Since —22 is negative, the two’s complement is used. First, 22 = 101IO 2 . Com¬ 
plete to eight places by attaching leading zeroes, getting 00010110. The one’s 

complement is 11101001, so the two’s complement is 11101010. 

Practice Exercise. In an 8-bit computer, how are 46 and —51 represented? 

In two’s complement arithmetic, the positive numbers all have first digit 0 and 
the negative numbers all have first digit 1. The largest positive integer that can be 
represented in an 8-bit computer is therefore OIIIIIII 2 = 127. This is not a great 
restriction in real-world computers—for example, if 64 bits are available, the largest 
possible positive integer would be greater than 9 million million million. In a calcu¬ 
lation there is no need to keep the exact value of such numbers and approximations 
are used. 

Sample Problem 1.25. In an 8 -bit computer, what numbers are represented by 

01101010 and 11010111? 

Solution. 01101010 represents the positive number 

IIOIOIO 2 = 2 6 + 2 5 + 2 3 + 2 


= 32 + 8 + 2 
= 42. 
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Since its first digit is 1, 11010111 represents a negative number. To compute it, 
reverse the construction process: subtract 1 to get 11010110 and then comple¬ 
ment to get 00101001. So the number represented is —10100 12 . 

101001 2 = 2 5 + 2 3 + l 
= 32 + 8 + 1 
= 41. 


So —41 is represented. 

Practice Exercise. In an 8-bit computer, which numbers are represented by 
01111001 and 10101101? 

Sample Problem 1.26. Use two’s complement arithmetic to calculate 53 — 14. 

Solution. 53 = IIOIOI 2 and 14 = IIIO 2 . So the representation of 53 is 
00110101 and that of 14 is 00001110. Thus 14 has one’s complement 11110001. 
So the two’s complement is 11110001 + 1 = 11110010, and —14 is represented 
by 11110010. Now add 

00110101 53 

11110010 -14. 

100100111 39 ^ 

The first digit (carry digit ) is ignored, as it takes us outside the eight-digit block 
available for integers. So the answer is represented by 00100111 which equals 
IOOIII 2 : 


100111 2 = 25 + 22 + 2+l 
= 32 + 4 + 2+1 
= 39, as expected. 

Practice Exercise. Use two’s complement arithmetic to calculate 8 — 4. 

The method just described will give an error when the sum is too large. For 
example, if we attempt to add 84 + 65, we get 

01010100 
01000001, 

10010101 

which represents a negative number, instead of the expected answer 149. The prob¬ 
lem is that 149 is larger than the maximum integer 127. This condition is called 
overflow. The computer has a way to detect this problem, which is beyond our scope 
here. 
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Because the computer is finite, any system of arithmetic will need a device to 
detect overflow. There are no other problems in two’s complement arithmetic; there 
are no “special cases” to be considered. Moreover, subtraction and addition are car¬ 
ried out in essentially the same way. There is only one form of “overflow.” (In some 
systems, different methods might be needed to check for positive and negative over¬ 
flow.) This is why two’s complement arithmetic is used in computers. 


Floating Point Arithmetic on Computers 

When large numbers are involved, or when an answer that is correct to a certain 
number of decimal places will satisfy our needs, we use floating point arithmetic. 
For the same reason, computers use floating point arithmetic, or a modified form of 
it. The floating point systems in computers utilize normalized floating point numbers 
in base 2. 

Historically, many types of floating point numbers have been used in computers. 
In the late 1970s the Institute of Electrical and Electronics Engineers developed a 
standardized format, which they published as IEEE Standard 754 (a “format” is a 
precise way of representing the number on the computer). This is the most common 
version found nowadays; it is the one used in most PCs. 

The IEEE15A format comes in three versions: single precision (32 bit), double 
precision (64 bit), and extended precision (80 bit) numbers. We shall describe single 
precision numbers; the others are similar. 

Each number is represented by a string of 32 binary digits. The first digit is the 
sign bit, and is 0 for nonnegative numbers and 1 for negative numbers. The next eight 
digits are used for the exponent, and are collectively called the exponent part. The 
final 23 digits are for the mantissa. 

The exponent is stored in “excess 127” form: if the exponent is e, the number 
stored is the binary equivalent of 127 + e. In eight binary digits one can represent all 
integers from 0 to 255; however, for technical reasons the numbers 0 and 255 are not 
used, so the numbers range from 1 to 254. These represent exponents in the range 
from —126 to 127. So the range of numbers that can be represented is from 2 -126 to 
2 127 , or approximately 10 -38 to 10 38 . 

The digit before the decimal point in the mantissa is nonzero. Since IEEE15A 
numbers are written in base 2, this means that the digit before the point in the man¬ 
tissa will always equal 1. To save space, the leading 1 is omitted and only the fraction 
is shown. The mantissa is length 24, but only the last 23 digits are recorded. If the 
mantissa is longer than 24 bits, it is truncated. 

Sample Problem 1.27. (a) Represent —0.09375 in IEEE1 54 format, (b) Convert 

0.1 to IEEE1 54 format. 


Solution, (a) First, convert 0.09375 to binary. 
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2 x 0.09375 = 0.1875, 

2 x 0.1875 = 0.375, 

2 x 0.375 = 0.75, 

2 x 0.75 = 1.5, 

2 x 0.5 = 1, 

so the binary form is 0.0001 12 , or 1.1 x 2 -4 . The exponent is —4, which has 
excess 127 form 123. Now 123 = IIIIOII 2 . So the exponent is stored as 
01111011. The sign bit is 1. So the IEEE15A form is 

1011 1101 1100 0000 0000 0000 0000 0000. 

(b) We now convert 0.1 to binary. 

2 x 0.1 = 0.2, 

2 x 0.2 = 0.4, 

2 x 0.4 = 0.8, 

2 x 0.8 = 1.6, 

2 x 0.6 = 1.2, 

2 x 0.2 = 0.4. 

Clearly the expression now repeats: 

0.1 = 0.000lT 

= 1 .T 00 T x 2 -4 . 

The sign bit is 0. The exponent is again —4, which has excess 127 form 
111101 12 - So the IEEE15A expression is 

0011 1101 1100 1100 1100 1100 1100 1100. 

Practice Exercise. Convert to IEEE154 format: 

12.25, -0.3, 112.5. 

Because it is difficult to read long binary strings accurately, IEEE15A numbers 
are often written in hexadecimal form. For example, the answers to the above Sample 
Problem would be written as #CZ)00000 and 3 DCCCCCC, respectively. 

Sample Problem 1.28. What is the IEEE15A representation of— 5.421875 ? What 
is its hexadecimal form? 

Solution. The sign bit is 1 since the number is negative. Now 5 = 101 2 . To 
convert 0.421875, 




1.5 Arithmetic in Computers 


29 


2 x 0.421875 = 0.84375, 

2 x 0.84375 = 1.6875, 

2 x 0.6875 = 0.375, 

2 x 0.375 = 0.75, 

2 x 0.75 = 1.5, 

2 x 0.5 = 1, 

so 0.421875 = O.OIIOII 2 , and therefore 

5.421875 = IOI.OIIOII 2 = 1.01011011 2 x 2 2 . 

2 + 127 = 129 = 1000000 12 . The exponent part is 10000001, and the represen¬ 
tation is 


1100 0000 1010 1101 1000 0000 0000 0000. 

The hexadecimal form is C0AD8000. 

Practice Exercise. What is the IEEE15A representation of 0.190625? What is 
its hexadecimal form? 

Sample Problem 1.29. Suppose x has IEEE154 representation 4A2C4000. Ex¬ 
press x as an integer times a power of 2. 

Solution. The sign bit is 0 , so x is positive. The exponent is IOOIOIOO 2 , which 
equals 144 10 . The mantissa is 010110001 (followed by zeroes), so 

X = 2 144-127 X 1.0101100012 
= 2 17 x 1.0101 1000 1 2 
= 2 17 x (2~ 9 x 10 1011 0001 2 ) 

= 2 8 x 689 

(since 10 1011 0001 2 = 512+ 128 + 32+ 16+ 1 = 689). 

Exercises 1.5 

In Exercises 1 to 18, assume you have an 8 -bit computer using two's complement 
arithmetic. In each question, how is the given number represented? How would the 
representation be written in hexadecimal? 

1. 46. 2. -46. 3. 127. 


4. 0. 


5. -1. 


6. -3. 
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7. -126. 

8. 21. 

9. -14. 

10. 57. 

11. -57. 

12. -127. 

13. -128. 

14. -2. 

15. -4. 

16. -125. 

17. 17. 

18. -23. 

In Exercises 19 to 30, what numbers are represented by the given number in an 8 -bit 
computer using two’s complement arithmetic? 

19. 00000000. 

20. 11111111. 

21. 10001010. 

22. 10000110. 

23. 01110001. 

24. 01011111. 

25. 00001111. 

26. 11110000. 

27. 11000110. 

28. 10000010. 

29. 10000011. 

30. 01101111. 


Use two’s complement arithmetic to carry out the calculations in Exercises 31 to 38. 
31. 17 - 4. 32. 26 - 38. 

33. -4 - 12. 34. 38 - 14. 

35. 15 - 7. 36. 28 - 32. 

37. 44-4. 38. -16- (-29). 

What is the I EE El 5 A representation of the numbers in Exercises 39 to 46? Give both 
binary and hexadecimal forms. 


39. 19.625. 

41. 48. 

43. 27. 

45. -1033.75. 


40. -19.625. 
42. 0.1. 

44. 1033.75. 
46. -0.3. 


The hexadecimal numbers in Exercises 47 to 52 are in IEEE1 54 format. What real 
numbers do they represent (in base 10)? (Give your answer as an integer multiplied 
by a power of 2.) 

47. 02A80000. 48. 9350C000. 

49. 86£C0000. 50. 03400000. 

51. 43D00000. 52. 6A£80000. 

53. Suppose a computer uses two’s complement arithmetic and 16 bits are available. 
What are the smallest and largest integers that can be represented? What are 
the approximate answers if 32 bits and 64 bits are available? (Use scientific 
notation.) 

54. Can the string 10000000 ever arise in two’s complement form? If “yes” what 
does it represent? If “no” why not? 
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Sets and Data Structures 


Sets are of fundamental importance in mathematics. The very number systems that 
we studied in the preceding chapter were all discussed in terms of a set of numbers. 
Many problems of discrete mathematics can conveniently be expressed in terms of 
sets, especially finite sets. For this reason, we need to discuss the properties of sets 
and develop language to talk about them. 

Along with sets, it is appropriate to discuss elementary logic. We shall observe a 
parallel between the logic of propositions and set theory. 

Both mathematical logic and set theory are broad areas of mathematics, and we 
only discuss them briefly here; moreover our interest is in the discrete case, and we 
emphasize finite cases. The interested reader will find that there is a wide literature 
on both these topics, and they contain very deep problems. 


2.1 Propositions and Logic 

Propositions and Truth Tables 

We shall define a proposition to be a statement that has a well-defined truth value , 
that is, it is either true ( T ) or false (F). Some statements in English are not 
propositions—one example is matters of opinion, such as “I like apples”; these are 
not propositions. On the other hand, “it will rain on this day next year” is a proposi¬ 
tion: it is either true or false (we are not worried about whether or not we know the 
truth value, or even if it is possible to know it). 

Simple propositions, like “today is Tuesday” and “it is raining,” can be combined 
to form compound propositions, like “today is Tuesday and it is raining,” by using a 
connective (“and” in the example). The truth value of a compound proposition can 
be calculated once we know the truth values of the simple propositions from which 
it is formed, and the connective used to combine these simple propositions together. 
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P 

q 

P V q 

p Aq 

p 

~P 

T 

T 

T 

T 

T 

F 

T 

F 

T 

F 

F 

T 

F 

T 

T 

F 



F 

F 

F 

F 


Table 2.1. Truth tables of A, V 


The simplest connectives are “not,” “and,” “or,” denoted by ~, A, V respectively. 
If p and q denote propositions, then the proposition “not p” (denoted is true 
precisely when p is false, the proposition “p and q” (denoted p A q) is true precisely 
when p is true and q is true, and the proposition “p or q” (denoted p V q) is true 
precisely when p is true or when q is true or when both p and q are true. (This is 
the meaning called “inclusive or,” the usual usage in mathematics.) The truth values 
of these compound propositions are shown in Table 2.1. Such tables are called truth 
tables. 

Formally, ~ p is called the negation of p, p V q is the disjunction of propositions 
p and q , and p A q is the conjunction of p and q. 

Often alternate phrases are used. For example, we sometimes use “as well (as)” 
instead of “and.” In English, we often use “but” instead of “and” when one of the two 
propositions is negative. Both these connectives are represented by A: if p means 
“today is cold” and q means “today is sunny,” then p A q could be translated as 
“today is cold and sunny,” “today is cold but sunny,” or “today is cold as well as 
sunny.” 

Sample Problem 2.1. Let p denote the proposition “the sun is shining ” and q 
the proposition “the wind is blowing” Write expressions for “the sun is not shin¬ 
ing,” “the sun is shining and the wind is blowing,” and “the sun is shining but 
the wind is not blowing” 

Solution. ~p denotes “the sun is not shining,” p V q denotes “the sun is shining 
or the wind is blowing,” and p A q denotes “the sun is shining and the wind is 
blowing.” 

Practice Exercise. Write expressions for “the wind is not blowing,” “the sun is 
shining or the wind is blowing (maybe both),” and “the sun is not shining but the 
wind is blowing.” 

Sample Problem 2.2. Suppose p, q and r mean “Joseph is here,” “Nancy is 
here” and “Donna is here” Interpret p A ~q and (p A q) A r. 

Solution, p A ~q means “Joseph is here but Nancy is not;” (p A q) A r means 
“Joseph, Nancy and Donna are here.” 

Practice Exercise. In this situation, interpret (p A r) A ~q and q V r. 
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Table 2.2. Truth table of (p V ~q) A (q V ~p) 


We sometimes think of a truth table as showing the truth or falsity of different 
outcomes of an experiment or set of events, as the following sample problem shows. 

Sample Problem 2.3. Suppose one card is drawn from a standard deck. Let p 
represent the statement “the card is a heart ” and q represent “the card is an 
honor ” {ace, king, queen, jack or ten). For which draws are p A q and p V q 
true ? 

Solution. For p A q to be true, the card must be a heart and it must be an honor. 
So it is true when the draw is the ace, king, queen, jack or ten of hearts (5 cases), 
and is false for all other cards (the other 47 cases), p v q will be true for all 
thirteen hearts and all five honors in clubs, diamonds or spades (28 cases in all) 
and false in the other 24 cases. 

Practice Exercise. In the same situation, which draws make pA~q true? Which 
make p v ~q true? 

To find the truth table of a statement with several connectives, we work one step 
at a time. For instance, to find the truth table of 

(p V ~<?) A (q v ~p), 

we consider ~p, ~q, p v ~q, q v ~p, and finally the whole expression, as shown 
in Table 2.2. 

Sample Problem 2.4. Find the truth table of(p A q) v (q A ~r). 

Solution. 


P 

q 

r 

~r 

(P A?) 

(« q A ~r) 

(pA?)v(^A ~r) 

T 

T 

T 

F 

T 

F 

T 

T 

T 

F 

T 

T 

T 

T 

T 
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T T 
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Table 2.3. Truth tables of p —► q and p q 


Practice Exercise. Find the truth table of (p v (q v (~p A ~r))). 

Two other connectives we use frequently are p -> q, meaning “if p then q” or 
“p implies q” and p o q, meaning “/? if and only if q” or in other words “if p then 
q and if q then p.” These are called the conditional (—>) and the biconditional (*>). 
Their truth tables are shown in Table 2.3. The interpretation of “implies” is that, if 
there is never a case where p is true and q is false, then we count p -> q as true; this 
explains the last two lines of the table. 

Sample Problem 2.5. Find the truth table for 

(P A ~q) (q V p). 

Solution. We proceed in steps as before. The result is as follows. 


p 

q 

~q 

P A' 

<? v p 

(P A ~<?) -)• (?V p) 

T 

T 

F 

F 

T 

T 
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F 

T 
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Practice Exercise. Find the truth table for 

(p v ~q) ** (q p)- 


Tautologies, Theorems and Logical Equivalence 

The statement Sample Problem 2.5 is in fact a tautology. A compound statement is 
a tautology if it is always true, regardless of the truth values of the simple statements 
from which it is constructed. A statement that is always false is called a contradic¬ 
tion', a very simple example is p A ~p. Other statements that do not fall into either 
category are called contingent. 

One of the main aims of logical deduction is to establish tautologies. For exam¬ 
ple, what we call theorems in mathematics are actually tautologies. The word “theo¬ 
rem” usually denotes a tautology whose essential truth is not immediately obvious, 
so that some proof is required to establish it. 
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One very easy example is the fact that every integer is a rational number. This 
requires a proof with only one step: we need to observe that any integer v can be 
written as the ratio of two integers, namely x/l. The truth of the theorem does not 
depend on the value of the rational number v. 

Sample Problem 2.6. Show that p v ~(/? A q) is a tautology. 

Solution. We use the following truth table. 


P q _ (pAq) ~(pAj) p v ~(p A q) 

T T T F T 

T F F T T 

F T F T T 

F F F T T 


Practice Exercise. Show that (p A q) A v q) is a contradiction. 

If p -> q is a tautology, we say “p implies q? and write “p qV If p q is a 
tautology, we say that “p is equivalent to q” and write “p q” or “p = qT In order 
to prove that p = q, it is sufficient to prove that p and q have the same truth table. 


The Laws of Logic 

A number of theorems (tautologies) about propositions may be deduced from truth 
tables, and together they form an algebraic system that is called mathematical (or 
symbolic ) logic. (An alternative view is to take some of the tautologies as axioms, 
and deduce the truth tables for the standard connectives.) Some of them are very rem¬ 
iniscent of the usual arithmetical laws, with = taking the place of equality. Among 
these we have: 

Commutative laws: 


Associative laws: 


pW q 

= 

q v p. 


p Aq 

= 

q A p. 


p V (q V r) 

= 

(p V q) V r, 


p A (q A r) 

= 

(p Aq) A r. 


p V (q A r) = 

(P 

> 

< 

> 

r). 

(p A q) V r = 

(P 

v r) a (q v 

r ), 

p A (q V r) = 

(P 

< 

> 

< 

r), 

(pv q) Ar = 

(P 

> 

< 

> 

r). 


Distributive laws: 
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where each statement is true for all propositions p,q,r. This reminds us of the be¬ 
havior of addition and multiplication, except that only one pair of distributive laws 
holds for ordinary arithmetic. 

In view of the associative laws, stated above, we can simply write p V q V r 
whenever either pv (qvr) or (pv q)v r,is intended, and similarly p Aq Ar means 
either p A (q A r) or (p A q) A r. 

If Ms a proposition that is always true, and if q is always false, then p acts like 
an identity element for the operation A and q acts like an identity element for V: 

p At = p, pV f = p, 

for all p. This is like the behavior of 1 under multiplication or 0 for addition. There 
are also zero laws: 


pVt = t, p A f = f. 


for all p. This reminds us of 0 under multiplication, but there is no corresponding 
element for addition. Finally, there are two laws called de Morgan’s laws : 

~(P v q) = (~p) A (~<?), 

^(p Aq) = (~p) V (~g) 

for all p and q. 


Exercises 2.1 


In Exercises 1 to 12, find the truth table for t 

1. A q. 

2. 

3. p {p ^ q). 

4. 

5. pv ^ q). 

6. 

7. ~p V~q. 

8. 

9. (p A q) V r. 

10. 

11. p V (q V r). 

12. 


(p -* q). 
(~P V ~g). 
6 . pv (~p A q). 
(p Aq). 


13. Use the results of Exercises 7 to 12 to prove the following equivalences. 

(i) V~q <£> ~(p A q). 

(ii) (p A q) V r (p V r) A (q V r). 

(iii) p V (q V r) (p V q) V r. 

Prove the equivalences in Exercises 14 to 16. 

14. (p^q) O (~p V q). 

15. (p q) & (~p A ~q) V (p A q). 
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16. (p ^ q) & (~q ~p). 

17. Prove that (q -> /?) is not equivalent to (/? —> g) in general. 

Find the truth tables for the compound statements in Exercises 18 to 21. 

18. (p ^ q) ^ r. 19. p (<? r). 

20. (p r) ^ (q ^ r ). 21. (p V q) A~(p A #). 

22. Consider four possible definitions of a connective /? f q, given by the following 
table. 


p 

q 

Definition of \ 

A 

B 

C 

D 

T 

T 

T 

T 

T 

T 

T 

F 

F 

F 

F 

F 

F 

T 

F 

F 

T 

T 

F 

F 

F 

T 

F 

T 


Show that definition D, and only definition D , makes the statement “p\ pvq” 
a tautology. Also show that definition A is that of p A q, B is that of p q, C 
is that of g itself, and D is p -> g • 

23. Prove that the following statements are equivalent. 

(i) (/? -> q). 

(ii) (p A 

(iii) (p A ~q) ^ q. 

24. Prove that (p ^ q) ^ r and p -> (g -> r) are not equivalent. 

25. Prove the two commutative laws. 

26. Prove the two associative laws. 

27. Prove the two distributive laws. 

28. Prove de Morgan’s laws. 

29. Let p\/_q denote the compound statement “ p or q but not both,” which is often 
called exclusive or. Find the truth table for p V q. Compare it with Table 2.2 and 
Exercise 21. 

In Exercises 30 to 39, find the truth table for the given statement. 

30. (p p). 

31. (p -* ~/j). 

32. p A 

33. (p Aq) (pv q). 

34. (p^ q) ^ (p A 4 ). 
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35. ((p -+ q) ->■ (p A q)) V (~p)- 

36. (~/?) A (p V q) —>■ (~q). 

37. g —> (/? —> ?)• 

38. (pv q) ^ (p A <7). 

39. ~p ^ (q p). 

40. Find truth tables for the following propositions. Are any of them 
equivalent? 

(i) (p ^ q) A (~r 

(ii) r ^ ~p. 

(iii) p -> ~r. 

(iv) ~((~£ -> ~p) A (<? -> ~r)). 


2.2 Elements of Set Theory 

Sets 

We saw seta in Section 1.1. The notations M, Q, Z, Z + and Z* were introduced for the 
sets of real numbers, rational numbers, integers, positive integers and non-negative 
integers respectively. 

As we noted, a set can be finite or infinite. We saw that it is often convenient 
to specify a finite set by listing its elements between braces, but most infinite sets 
must be defined by explicitly stating the membership law. The set of all objects x for 
which the statement S(x) is true was written {x \ ^(x)}, or sometimes as {x : 5(x)}. 
For example, the set 77 of prime numbers could be denoted (trivially) as 

77 = {x | x is a prime}. 

The definition of a set does not allow for ordering of its elements, or for repeti¬ 
tion of its elements. Thus {1, 2, 3}, {1, 3, 2} and {1, 2, 3, 1} all represent the same set 
(which could be written {x | x e Z* and x < 3}, or {x e Z* | x < 3}). To handle 
problems that involve ordering, we define a sequence to be an ordered set. Sequences 
can be denoted by parentheses; (1, 3, 2) is the sequence with first element 1, second 
element 3 and third element 2, and is different from (1, 2, 3). Sequences may con¬ 
tain repetitions, and (1, 2, 1, 3) is quite different from (1, 2, 3); the two occurrences 
of object 1 are distinguished by the fact that they lie in different positions in the 
ordering. 

We defined the notation 


S £ S 
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to mean “5 belongs to 5” or “5 is an element of S,” and S' c T to mean S is a subset 
of T. If S c T we also say that T contains S or T is a superset of S, and write 
T 2 S. Sets S and T are equal, S = T, if and only if S c T and T c S are both 
true. We can represent the situation where S is a subset of T but S is not equal to 
T —there is at least one member of T that is not a member of S—by writing S C T. 

An important concept is the empty set, or null set, which has no elements. This 
set, denoted by 0, is a subset of every other set. 

In all the discussions of sets in this book, we shall assume (usually without both¬ 
ering to mention the fact) that all the sets we are dealing with are subsets of some 
given universal set U. U may be chosen to be as large as necessary in any problem 
we deal with; in most of our discussion so far we could have chosen U = Z or 
U = R. U can often be chosen to be a finite set. 

The power set of any set S consists of all the subsets of S (including S itself 
and 0), and is denoted by V(S): 

V(S) = {T :T a}. (2.1) 

The power set is a set whose elements are themselves sets. 

Sample Problem 2.7. Write down all elements of the power set of{ 1,2, 3}. How 

many elements are there ? 

Solution. There are eight elements: {1, 2, 3}, {1, 2}, {1, 3}, {2, 3}, {1}, {2}, {3}, 

and 0. 

Practice Exercise. Write down all elements of the power set of {x, y, z}- 


Operations on Sets 

Given sets S and T, we define three operations: the union of S and T is the set 
S U T = [x : v g S or v e T (or both)}; 

the intersection of S and T is the set 

S nr = {i:ig5 andx e T}\ 

the relative complement of T with respect to S (or alternatively the set-theoretic 
difference or relative difference between S and T) is the set 

S\T = {x : x g S andv ^ T}. 

In particular, the relative complement U\T with respect to the universal set U is 
denoted by T and called the complement of T. We could also write R\S = R H S, 
since each of these sets consists of the elements belonging to R but not to S. Hence 
we see that R c S if and only if R\S = 0. 
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Sample Problem 2.8. IfE is the set of all even integers, what are E U 77, E Pi 77, 

E\77, ZUZ+, Z*\77? 

Solution. 

E U 77 = {..., -8, -6, -4, 2, 0, 2, 3, 4, 5, 6, 7, 8, 10, 11,...}, 

E H 77 = {2}, 

E\77 = {..., -8, -6, -4, -2, 0, 4, 6, 8,...}, 

zuz + = z, 

Z*\77 = {0, 1, 4, 6, 8, 9, 10, 12, 14, 15, 16, 18,...}. 

Practice Exercise. What are Z\Z + , Z Pi Z + , (Z + \E) U 77? 

If two sets, S and T, have no common element, so that S P\ T = 0, then we say 
that S and T are disjoint. Observe that S\T and T must be disjoint sets; in particular, 
T and T are disjoint. If n sets Si, S 2 , • • •, S n are such that each pair of them are 
disjoint, so that 

Si P Sj for 1 < i, j < n and i j, 

then we say that these n sets are pairwise disjoint or mutually disjoint. By a partition 
of a set S we mean a collection of pairwise disjoint non-empty sets Si, S 2 , ..., S n 
whose union is S. 

In general, to prove that the set S is a subset of the set T, we start with the 
statement, “suppose x is any element of Sand finish with “therefore x is an element 
of 77’ To show that S and T are equal, prove both Kf and S D T . Another method 
of proving S = T is to work as follows. Find an exact description of the elements of 
S —something of the form “S is precisely the set of all elements x with the following 
properties ...,” and prove that this description is also precisely the description of 
elements of T . 

Sometimes proofs of the form “suppose x is any element of 5”’ are simpler if 
the argument is broken into two parts: first consider all elements x with a certain 
property, then all those without that property. For example, to prove that (R\S) U S = 
R U S, for any two sets R and S , first observe that if x is a member of S , then 
it belongs to both (R\S) U S and R U S. So we need only discuss x not in S. The 
elements of (R \ S) U S not in S are precisely the elements of R\S, while the elements 
of R U S not in S are the members of R not in S —precisely the same elements. So 
(R\S) US = RUS. 


Properties of the Operations 

We now investigate some of the easier properties of the operations U, D and \; for 
the more difficult problems, we shall introduce some techniques in the next section. 
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Union and intersection both satisfy idempotence laws : for any set S, 


su s = sn s = s. 


Both operations satisfy commutative laws; in other words 


SUT = TU S 


and 


s nr = t n s, 


for any sets S and T. Similarly, the associative laws 

RU(SUT) = (RU S)UT 


and 


r n (s n T) = (R n s) n t 


are always satisfied. The associative law means that we can omit brackets in a 
string of unions (or a string of intersections); expressions like (A U B) U (C U D ), 
((A U B)U C)U D and (A U (B U C)) U D , are all equal, and we usually omit all the 
parentheses and simply write AU5UCUD. But be careful not to mix operations. 
(A U B) f! C and A U (B D C) are quite different. Combining the commutative and 
associative laws, we see that any string of unions can be rewritten in any order: for 
example, 

(D U B) U (C U A) = C U (B U (A U £>)) = (A U B U C U D). 

Sample Problem 2.9. Prove that (AU£)nC = A U (B Pi C) is not always true. 

Solution. To prove that a general rule is not true, it suffices to find just one case 
in which it is false. This is called a counterexample. As an example we take the 
case A = M, B = Z, C = {0}. Then (AU B)HC = {0}, while AU (B HC) = M. 

The following distributive laws hold: 


R U (S n T) = (R U S) n (R U T); 
r n (s u T) = (R n s) u (R n T); 
(R U S)\T = (R\T) U (S\T). 


( 2 . 2 ) 

(2.3) 

(2.4) 


Sample Problem 2.10. Prove the distributive law (2.2). 

Solution. Suppose v e R U (S D T). It may be that v e R; in that case, both 
x e (R U S) and x e (R U T) are true (in fact, v e (R U A) is true for 
any set A), so x e (R U S) Pi (R U T). On the other hand, if x g R, then 
v g (S Pi T ), and v belongs both to S and to T. Now x e S =>► x e (R U S), and 
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x e T x e (R U T ), so x e (S’ Pi T) => x e (R U S) Pi (R U T). So in either 
case, 

xeRU(SHT) => X e (R U S) n (R U T), 
and R U (S D T) c (R U S) Pi (R U T). 

Conversely, suppose x e (R U S) D (R U T). If x e R , then certainly x e 
R U (5 D T). If x g R , then x e (R U S) => x e S, and x e (R U T) => x e T. 
So 

X e (R u S) n (R u T) =» xe(^nr) =>► x e RU (S nT) 
and (R U S) D (R U T) C R U (S H T). So the two sets are equal. 

Practice Exercise. Prove the distributive law (2.3). 

We have also the equation 

R\(S U T) = ( R\S ) H (R\T), (2.5) 

and the analogous 

R\(S HT) = ( R\S ) U (/?\r). (2.6) 

Sample Problem 2.11. Prove (2.5) from the definition. 

Solution. R\(S U T) consists of precisely those members of R that are not 
members of S U T, in other words those elements of R that do not belong to S or 
to T. That is, 

R\(S U T) = {x | x e R andx ^ S andx ^ T}. 

On the other hand, (R\S) consists of all the things in R that are not in S , and 
(R\S) fl (R\T) consists of all the things in R that are not in T ; the common 
elements of these sets are all the things in R but not in S and not in T, which is 
the same as the description of R\(S U T). 

Equation (2.5) can also be verified using the idempotence, associative and com¬ 
mutative laws. From 

R\(S U T) = {x | x g R and x $ S and x $ T] 


we have 

R\(SUT) = Rn (sn T) 

= (R D R) IT (S D T) ... idempotence 
= (R H S) P\ (R H T) ... associativity, commutativity 
= (R\S)n(R\T). 
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When we take the particular case where R is the universal set in (2.5) and (2.6), 
those two equations become de Morgan's laws : 

SUT = S n T, (2.7) 

Y7vF = SUT. (2.8) 


Exercises 2.2 

1. Suppose A = {a, b, c, d , e}, B = { a , c, e, g, /}, C = {c, /, /, e, o}. Write down 
the elements of 

(i) AUB. (ii) AHC. 

(iii) A\B. (iv) A U (B\C). 

2. Suppose A = {2, 3, 5, 6 , 8 , 9}, B = {1, 2, 3, 4, 5}, C = {5, 6 , 7, 8 , 9}. Write 
down the elements of 

(i) AH B. (ii) AUC. 

(iii) A\(B n C). (iv) (A U B)\C. 

3. Suppose A = {1, 2, 4, 5, 6 , 7}, 5 = {1, 3, 5, 7, 9}, C = {2, 4, 6 , 7, 8 , 9}. Write 
down the elements of 

(i) AUBUC. (ii) A U (B n C). 

(iii) A\C. (iv) A n (5\C). 

4. Suppose A = Z + , 5 = {—4, —2, 1, 3, 5, 7}, C = {x \ x 2 = 1}. Write down the 
elements of 

(i) (AHB)UC. (ii) AHBHC. 

(iii) C\A. (iv) A H (B\C). 

5. Consider the sets 

51 ={2,5}, 

5 2 = {1,2, 4}, 

5*3 = {1,2, 4,5, 10}, 

5 4 = {x e Z + : x is a divisor of 20 }, 

5 5 = {x e Z + : x is a power of 2 and a divisor of 20 }. 

For which i and j, if any, is Si c 5y? For which i and j, if any, is 5) = 5y? 

6 . In each case, are the sets S and T disjoint? If not, what is their intersection? 

(i) S is the set of perfect squares 1, 4, 9 ,..T is the set of cubes 1, 8 , 27,... 
of positive integers. 

(ii) S is the set of perfect squares; T = M\M + . 
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(iii) S is the set of perfect squares 1, 4, 9, .. T is the set 77 of primes. 

7. In each case, are the sets S and T disjoint? If not, what is their intersection? 

(i) S is the set of all multiples of 5; T is the set of all perfect squares. 

(ii) S is the set of all students in your class; T is the set of all students in your 
college. 

8. Prove the commutative and associative laws for U. 

9. Prove the commutative and associative laws for Pi. 

In Exercises 10 to 20, U is a universal set and S and T are any sets. Prove the given 
result. 

10. S U 0 = 5. 11. S U U = U. 12. S U S = S. 

13. s u s = u. 14 . snu = s. 15. s n 0 = 0 . 

16. s n s = s. 17. s pi s = 0. 18. (^nr)c s. 

19. K(5 U T). 

20. If 5 U T = U and S fl T = 0, then T = S. 

21. Prove the distributive law in (2.4). 

22. Prove de Morgan’s laws. 

23. Suppose the sets A, B, C, D, S are defined in terms of 0 as follows. 

A = {0}, B = {A}, C = {0, A}, 

D = {0, A, C}, 5 = {0, A, B , C, D}. 

Show that: 

(i) [x | v g S and v c D} = S; 

(ii) {x | v g S and x e D} = D. 

In Exercises 24 to 30, R , S' T are any sets, and U is the universal set. 

24. Prove: if R c S and 7? c T, then 7? c (S Pi T 7 ). 

25. Prove: if 7? c T and S c T, then (R U S) c 7\ 

26. Prove: if 7? c S, then 7?H7 c 5n7and7?U7 c 5U7. 

27. Show that R c S if and only if 7? Pi S = 0. 

28. Show that if SU7 7 = 0,thenS = T = 0 and that if SH7 7 = I/, then S = T = 77. 

29. Prove that R\(S\T) contains all members of R Pi T, and hence prove that 

(R\S)\T = R\(S\T) 

is not a general law (in other words, relative difference is not associative). 
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30. Show that the following three statements are equivalent: S c T, S U T = T, 
SHT = S. 

In Exercises 31 to 42, the sets A, B,C, D are defined as follows: A = {0}, B = {A}, 
C = {0, A}, D = {B}. Determine whether the given statement is true or false. 


31. 0 c A. 

32. 0 € A. 

33. 0 € B. 

34. 0 c B. 

35. Acs. 

36. A e B. 

37. ACC. 

38. AeC. 

39. B c C. 

40. SeC. 

41. BCD. 

42. B e D. 

2.3 Proof Methods in Set Theory 



Method of Truth Tables 

Set-theoretic propositions can often be proved using truth tables. The reasoning is as 
follows. To prove that sets S and T are equal, it suffices to show that, for any element 
v of the universal set, the statement “jc e S os e T” is a tautology. This can be 
proved (or disproved) using a truth table. 

To illustrate this, consider the distributive law 

DL\R U (S H T) = (R U S) D (R U T). (2.2) 

The truth table might look like the following 


R 

S 

T 

SOT 

RUS 

RUT 

RU{SC\T) 

(RUS)U(RU T) 

T 

T 

T 

T 

T 

T 

T 

T 

T 

T 

F 

F 

T 

T 

T 

T 

T 

F 

T 

F 

T 

T 

T 

T 

T 

F 

F 

F 

T 

T 

T 

T 

F 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

F 

T 

F 

F 

F 

F 

F 

T 

F 

F 

T 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 


In writing the table, it is implicitly assumed that a general element x is being consid¬ 
ered; the notation T (or F) under the name of a set means that the statement that x is 
a member of that set is true (or false). For example, the third line of the table could 
be interpreted as saying, “in all cases where x e R is true, x e S is false, and x e T 
is true, it is always false that x e S HT, true that x e R U S, true that x e RUT, 
true that x e R U (S Pi T) and true that x e (R U S) Pi (R U T)T 
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R 

S 

R 

RUS 

RFS 

(R\S) 

T 

T 

F 

T 

T 

F 

T 

F 

F 

T 

F 

T 

F 

T 

T 

T 

F 

F 

F 

F 

T 

F 

F 

F 


Table 2.4. Truth tables for R U S, R n S and (R\S) 


The statement (2.2) is a tautology provided the last two columns are equal, which 
is seen to be true. 

Compare this proof with the proof given in Sample Problem 2.2.2.10. 

In order to apply the truth table method, it is necessary to know the truth tables 
of the main binary operations. The truth tables for complement, union, intersection 
and relative complement are shown in Table 2.4. 

The method of truth tables can also be used to prove that one set is a subset of 
another. If you need to prove that R c S, then the last two columns will be labeled R 
and S , and it is required that no row has T in the R column and F in the S column. 

Sample Problem 2.12. Prove that AnCc(An5)UC. 

Solution. The truth value of x e A is opposite that of x e A, so the table is as 

follows. 


A 

B 

C 

A 

AFB 

AFC 

(A n 5 ) u c 

T 

T 

T 

F 

F 

T 

T 

T 

T 

F 

F 

F 

F 

T 

T 

F 

T 

F 

F 

T 

T 

T 

F 

F 

F 

F 

F 

T 

F 

T 

T 

T 

T 

F 

T 

F 

T 

F 

T 

T 

F 

T 

F 

F 

T 

T 

F 

F 

T 

F 

F 

F 

T 

F 

F 

F 


There is no line with T in the second-last column and F in the last, so inclusion 
is proved. 

Practice Exercise. Prove that An^nC c 5n(AUC). 


Venn Diagrams 

It is common, and useful, to illustrate sets and operations on sets by diagrams. A set 
A is represented by a circle, and it is assumed that the elements of A correspond to 
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Fig. 2.1. R U (S H T) = (R U S) Pi (R U T) 


the points (or some of the points) inside the circle. The universal set is usually shown 
as a rectangle enclosing all the other sets; if it is not needed, the universal set is often 
omitted. Such an illustration is called a Venn diagram. 

Here are Venn diagrams representing A U B, A H B, A and A\B; in each case, 
the set represented is shown by the shaded area. The universal set is shown in each 
case. 



A A\B 


Two sets are equal if and only if they have the same Venn diagram. In order to 
illustrate this, we again consider the distributive law 

DL1R U(SnT) = (RUS)n(RU T). (2.2) 

The Venn diagram for R U (5 D T) is constructed in the upper half of Figure 2.1, and 
that for (R U S) Pi (R U T) is constructed in the lower half. The two are obviously 
identical. 

In the first part of this section, we applied the method of truth tables (developed 
for use with propositions) to set identities. We can also apply the methods of set 
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theory to the analysis of propositions. If s is any proposition, we define S to be the 
set of all sets of circumstances in which proposition s is true; similarly we make 
proposition t correspond to set T. Then s O t is equivalent to S' = T, and the Venn 
diagram that shows the set equality also indicates the proposition equivalence. 

For example, the preceding Venn diagram illustration that 

r u (s n T) = (R u s) n (R u t) 

for all sets R , S and T may also be used to construct a proof of the distributive law 
for propositions, 

r V (s A t) (r V s) A (r V t). 

Sample Problem 2.13. Write down a statement involving propositions that can 

be proven by establishing the set-theoretic identity 

(R\S)\T = R\(S\T). 

Solution. We let r correspond to set R , and so on. As R\S corresponds to the 

proposition rA~5, the answer is 

((r A A 4>(rA A ~t)). 

To prove A c B, it is sufficient to show that the diagram for A contains no 
shaded area that is not shaded in B . We illustrate this idea with the problems from 
Sample Problem 2.12 and its associated Practice Exercise (but in reverse order). 

Sample Problem 2.14. Use Venn diagrams to illustrate that 
An B DC C B n (A U C). 


Solution. 




Practice Exercise. Use Venn diagrams to illustrate that 


Ancc(An£)uc. 
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In many cases, you may not be sure whether or not two expressions represent the 
same set. Often the best response is to construct the corresponding Venn diagrams. 
If the diagrams are different, this disproves the equality; if they are the same, the 
identity will be provable. 

As before, this type of set-theoretic proof can be applied to propositions. A proof 
that R c S serves as a proof that r —> s. Sample Problem 2.14 can be interpreted as 
a proof that 

aAbAc^bA(aVc) 
is true for any propositions a , b and c. 


Syllogisms and Venn Diagrams 

Sometimes we draw a Venn diagram in order to represent some properties of sets. 
For example, if A and B are disjoint sets, the diagram can be drawn with A and B 
shown as disjoint circles. If A c B, the circle for A is entirely inside the circle for B. 

In the classical study of logic, an argument is given in the form of a syllogism , 
a set of statements (the premises , or data) and a conclusion drawn from them. For 
example, consider the argument 

All big cities are near water; Alamagordo, NM is not near water; therefore 

Alamagordo is not a big city. 

To examine this, suppose B is the set of all big cities, 

W is the set of all cities near water, and a represents Ala¬ 
magordo. Since the premises tell us that B c W, we can 
draw the sets as shown. As a £ W, it must lie somewhere 
in the outside region, so it is certainly not in B. Therefore 
the argument is valid. 

Some arguments that look logical at first sight turn out not to be valid. For exam¬ 
ple, 



All big cities are near water; Carbondale, IL is near water; therefore Car- 
bondale is a big city. 


We label the sets as before. In the diagram shown, Car¬ 
bondale could be represented either by x or by y, so you 
can’t draw any conclusion. The argument is not valid. (In 
fact, x is nearer the mark.) 
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Sample Problem 2.15. Examine the argument: no students in this class are lazy; 
John is a math major; all math majors are lazy; so John is not a student in this 
class. 

Solution. Write L , C and M for the sets of lazy students, students in this class 
and math majors. The premises are represented in the following diagram. 



John is a member of M, which is disjoint from C. So John is not in this class, 
and the conclusion is valid. 

Practice Exercise. Examine the argument: some students in this class are lazy; 
all males are lazy; so some students in this class are males. 

Observe from the above example that the validity of an argument does not de¬ 
pend on the truth of the premises or conclusion. Not all math majors are lazy, and 
I wouldn’t venture to guess how many students in your class are lazy! Another ex¬ 
ample, where the premises and conclusion are all false but the argument is valid, 
appears in Exercise 2.3.26. 


Exercises 2.3 

In Exercises 1 to 8 , represent the set in a Venn diagram. 
1. RUSUT. 

3. fiu(snr). 

5. (R\S)DT. 

7. (R n S)\(S n T). 


2 . /iusur. 

4. (sn^)nr. 

6. RD(S\T). 

8. (R U T)\(S n T). 


9. Prove: R = ( R U S) U (R n S'). 


10. Find a simpler expression for S U ( (R U S) fl R). 

In Exercises 11 to 15, prove the rule using truth tables and illustrate it using Venn 
diagrams. 

11 . 5 0 5 = 0 . 


12. 5 U T = 5 O T. 


13. 5 0 T = 5 U T. 
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14. (5nr)c 

15. 5 c (5 u r). 

16. Use truth tables to represent the commutative and associative laws for U. 

17. Use Venn diagrams to represent the commutative and associative laws for D. 

18. For any sets R and S , prove R Fl (R U S) = R. 

19. Prove, using Venn diagrams, that (R\S)\T = R\(S\T ) does not hold for all 
choices of sets R, S and T. 

20. (i) Prove, without using truth tables or Venn diagrams, that union is not dis¬ 

tributive over relative difference: in other words, prove that the following 
statement is not always true: 

(R\S) UT = (RU T)\(S U T). 

(Hint: use the fact (R\S) U S = R U S.) 

(ii) Now prove this using Venn diagrams. 

21. Draw Venn diagrams for use in the following circumstances: 

(i) All my goldfish are tropical fish. 

(ii) None of my goldfish are tropical fish. 

In Exercises 22 to 25, test the validity of the argument hy drawing the appropriate 

Venn diagram. 

22. All men are mortal; Socrates is a man. Therefore Socrates is mortal. 

23. All my friends are students; none of my neighbors are students; Ruth is my friend. 
Therefore Ruth is not my neighbor. 

24. Boston is a big city; all big cities have department stores; Shirley lives in a city 
with no department store. Therefore Shirley does not live in Boston. 

25. All businessmen are wealthy; all mathematicians are cheerful; David is a busi¬ 
nessman; no cheerful people are wealthy. Therefore David is not a mathemati¬ 
cian. 

26. Show that the following argument is valid, although its premises and conclusion 
are all false: All expensive food contains cholesterol; steak contains no choles¬ 
terol. Therefore steak is not expensive. 

27. Show that the following argument is not valid, although its premises and con¬ 
clusion are all true: Some animals walk on two legs; human beings are animals; 
therefore human beings walk on two legs. 

28. Consider the data: All authors are solitary people; all physicians are rich; no 
solitary people are rich. Which of the following conclusions can be drawn? 

(i) No authors are rich. 

(ii) All physicians are solitary. 
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(iii) No one can be both an author and a physician. 

29. Consider the data: I sold back all my expensive textbooks last year; all my science 
textbooks are green; I did not sell back any green books last year. Which of the 
following conclusions can be drawn? 

(i) None of my science textbooks are expensive. 

(ii) All of my science textbooks were sold back last year. 

(iii) Some of my science textbooks were sold back last year. 

(iv) None of my science textbooks were sold back last year. 

(v) No green textbooks were sold back last year. 

30. Consider the data: All topcoats are expensive; none of my clothes are expensive; 
all expensive clothes are well made. Which of the following conclusions can be 
drawn? 

(i) I do not own a topcoat. 

(ii) All topcoats are well made. 

(iii) None of my clothes are well made. 


2.4 Some Further Set Operations 

Symmetric Difference 

Another important set operation is symmetric difference. The symmetric difference 
of A and B is the set 

A + B = {x:xeAovxeB but x $ A D B}. (2.9) 

Sample Problem 2.16. What is the truth table for A + B ? 

Solution. 

~A B A + B 

~T T F 

T F T 

FT T 
F F F 


Practice Exercise. What is the Venn diagram for A + B1 


This definition could be stated as 
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A + B = (A U B)\(A H B) 

= (A u B) n (A n B) 

= (AU B)n(AUB), (2.10) 

using (2.7). By the symmetry of the relation in (2.10), it follows that 

A + £ = A + B. 

From the definition, we may consider A + B to be the union of the difference between 
A and B with the difference between B and A. This implies that 

A + B = (A\B)U(B\A), 

and hence that 

A + B = (AUB)U(AnB). (2.11) 

If a and b denote the propositions 66 x e A” and “x e B” respectively, then 
the proposition “x e A + B ” is denoted by aw_b. (For the definition of V, see 
Exercise 2.1.29) Then 

<2 vZ? 44 ((<a V b) V ~(a a b)) 4^ (( a V b) A (~a V ~fe)) (2.12) 

is a restatement of (2.10), and similarly we see that 

(av_b) 44 (~a\/_~b) 


while (2.11) yields 

(i a\/_b ) 44 (a A ~Z?) V (~0 A Z?). (2.13) 

It is easy to see that symmetric difference satisfies the commutative law 

A + B = B + A. (2.14) 

The associative law is also true, but it is harder to prove, so we state it as a 
theorem. 

Theorem 7. Symmetric difference satisfies the associative law 
A + (B + C) = (A + B) + C. 

Proof. We start from (2.10), 

A + B = (AU B)H(AUB). 


Then 


A + B = (A U B) U (A H B) by (2.7) 
= (A H B) U (A H B). 


(2.15) 
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Fig. 2.2. Associativity of symmetric difference 


From (2.11), 

(A + 8) + C = [(A + B)nC]U [(A + B) n c] 

= {[(A n B) u (A n B)] n c} 

U {[(A n B) U (A n B)] n c} by (2.11) and (2.15) 

= {AnBnc}u|InBnc) 
u{ a n b n C} u {A n b n C}. 

In exactly the same way we may prove that 

£ + c) + A = {£ncnA}u{#ncnA} 
u{£ ncnA)u{5ncnA). 

Since union and intersection are commutative and associative operations, the right- 
hand sides of the last two equations are equal, so 

(B + C) + A = (A + B) + C; 

on applying the commutative law in (2.14) to the left-hand side, we obtain 

A + (B + C) = (A + B) + C. □ 

In view of Theorem 7, we can write A + B + C instead of (A + B) + C. In the 
expression 

{An5nc)u{An5nc)u(An5nc)u{An5nc) 

the first three terms are the sets of all elements belonging to exactly one of A, B 
and C, while the fourth term is the intersection of all three. So A + B + C consists 
of all those elements which belong to an odd number of the sets A, B and C ; see 
Figure 2.2, where A + B + C is represented by the shaded area. 

The proof of Theorem 7 using truth tables or Venn diagrams is left as an exercise. 


Cartesian Product 

We define the Cartesian product (or cross product) S x T of sets S and T to be the 
set of all ordered pairs (s,t) where s e S and t e T: 
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s xT = {0,0 :s e S,t eT}. 

There is no requirement that S and T be disjoint; in fact, it is often useful to consider 
SxS. 

The number of elements of S x T is | S\ x | T\ . (This is one reason why the symbol 
x was chosen for cartesian product.) 

Sample Problem 2.17. Suppose S = {0, 1} and T = { 1, 2}. What is S x T? 

Solution. SxT = {(0, 1), (0, 2), (1, 1), (1, 2)}, the set of all four of the possible 
ordered pairs. 

Practice Exercise. What is S x T if S = {1, 2} and T = {1, 4, 5}? 

The sets (R x S) x T and R x (S x T) are not equal; one consists of an ordered 
pair whose first element is itself an ordered pair, and the other of pairs in which the 
second is an ordered pair. So there is no associative law, and no natural meaning for 
R x S x T. On the other hand, it is sometimes natural to talk about ordered triples 
of elements, so we define 

R x S x T = {(r, s, t) : r e R, s e S,t e r}. 

This notation can be extended to ordered sets of any length. 

There are several distributive laws involving the cartesian product: 

Theorem 8 . If R, S and T are any sets, then 

(i) R x (S U T) = (R x S) U (R x T). 

(ii) Rx(SHT) = (Rx S)H(Rx T). 

Proof, (i) We prove that every element of R x (S U T) is a member of (R x S) U 
(R x T), and conversely. 

First observe that 

R x (S U T) = {(r, s) | r e R and s e S U T] 

= {(r, s) \ r e R and (s e S or s e 7 1 )}. 

On the other hand, 

(R x S) U (R x T) = {(r, s) \ (r, s) e R x S or (r, s) e R x T } 

= {(r, s) \ (r e R and s e S) or (r e R and 5Gl)} 

= {(r, s) | r g R and (s e 5 or s e T) }, 

where the last equality follows from the distributive law for propositions , applied to 
the propositions r e R, s e S and s e T. 
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It is now clear that (r, s) e R x (S U T) and (r, s) e (R x S) U (R x T) are 
equivalent. 

The proof of part (ii) is left as an exercise. □ 

Exercises 2.4 


1. Prove Theorem 7 

(i) Using truth tables; 

(ii) Using Venn diagrams. 

2. Prove the two distributive laws 

a n (B + c) = (A n B) + (A n c), 

(A + 5) n c = (A n c) + (B n c). 

(i) Using truth tables. 

(ii) Using Venn diagrams. 

3. Show that union is not distributive over symmetric difference. (Hint: consider 
the set S U (5 + T).) 

4. Show that S + T = 0 if and only if S = T. 

In Exercises 5 to 14, decide whether the given statement is true or false. Prove each 
of the statements that you think is true: give a counterexample for each statement 
that you think is false. 

5. R + (S n T) = (R + S) n (R + T). 

6 . R + (S\T) = (R + S)(R + T). 

7. R\(S nr) = (R\S) U (R\T). 

8 . 7? n (s + T) = (R n s) + (/? n T). 

9 . RU (S\T) = (R U S)\(fl U r). 

10. R + (S U T) = (R + S) U (R + T). 

11. /?\(5 ur) = (/?\s) n (/?\r). 

12. /?\(5 + T) = (R\S) + (R\T). 

13. 7? n (s\r) = (R n s)\(7? n t). 

14. 7? U (5 + T) = (R U S) + (R u T). 

15. In each case, list all elements of S' x T. 

(i) S = {1,2,3}; T = {1,4,5}. 

(ii) S = {x\x 2 =l}- T = {y \ y 2 = 4). 

(iii) S = {1,3, 5, 7}; T = {1,2,3}. 
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16. In each case, list all elements of R x S x T. 

(i) R = {1,2}; 5 = {3,4}; T = {5,6}. 

(ii) R = {12, 13, 14}; 5 = {1}; T = {1, 2, 3}. 

17 > (i) If S = 0, T ± 0, what isSxT? 

(ii) If S' x T = T x 5', what can you say about S and T? 

18. Prove the distributive law R x (S H T) = (R x S) H (R x T); 

19, (i) If A c S, B c T, show that Ax B c 5 xT. 

(ii) Find an example of sets A, B , S, T, such that A x B c S' x T and B T, 
but A g 5. 


2.5 Mathematical Induction 

The Principle of Mathematical Induction 

In working with finite sets or with the sets of positive integers and of integers, one 
repeatedly uses a technique of proof known as the method of mathematical induction. 
The general idea is as follows: suppose we want to prove that every positive integer 
n has a property P(n). We first prove P(l) to be true. Then we prove that, for any n , 
the truth of P(n) implies that of P(n+1); in symbols: 

P{n) true =>► P(n + 1) true. (2.16) 

Intuitively, we would like to say: 


P(l) true, 



P( 1) true => 

P( 2) true; 

P( 2) true 

P( 2) 

true, by (2.16), 

P (2) true => 
P( 3) true; 

P( 3) 

true, by (2.16), 


and so on. There is a difficulty, however: given any positive integer k, we can select 
an integer n such that the proof of P(n) requires at least k steps, so the proof can be 
arbitrarily long. As “unbounded” proofs present logical difficulties in mathematics— 
who could ever finish writing one down?—we need an axiom or theorem that states 
that induction is a valid procedure. This is the principle of mathematical induction , 
and may be stated as follows. 

Principle of mathematical induction Suppose the proposition P(n) satisfies 
(i) P{\) is true; and 
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(ii) for every positive integer n , whenever P(n) is true, then P(n + 1) is true. 

Then P(n ) is true for all positive integers n. 

This principle is sometimes called weak induction. Another form is as follows. 
Strong induction Suppose the proposition P(n) satisfies 

(i) P(\) is true; and 

(ii) for every positive integer n , if P(k) is true whenever 1 < k < n, then Pin) is 
true. 

Then P(n) is true for all positive integers n. 

At first sight, the second statement looks as though we have assumed more than 
in the first statement. However these two forms are equivalent; a detailed proof can 
be found in more advanced books. 

In the above formulations, the case n = 1—the case of P(l)—is called the base 
case. There is in fact nothing special about 1; any integer could be used to define the 
base case. For example, weak induction could be stated as 

Suppose the proposition P(n) satisfies 

(i) P(t) is true for some specified integer t\ and 

(ii) for every positive integer n > t, whenever P(n) is true, then P(n + 1) is true. 
Then P(n) is true for all integers n > t. 

Induction can also be stated in strictly set-theoretic form. Suppose S is the set of 
integers n such that P(n) is true. Then the principle of mathematical induction (weak 
induction form) is: 

Let S be a subset of Z + such that 

(i) 1 G 5; and 

(ii) whenever n e S, then n + 1 e S. 

Then S = Z+ 

One could equally well state the principle in terms of non-negative integers, in¬ 
stead of positive integers, by changing the case P(0) to F(l), and converting refer¬ 
ences from “positive integers” to “non-negative integers.” It can in fact be stated in 
terms of any starting point: if S is a set of integers for which 

(i) t e S , 

where t is any integer, and 

(ii) for every integer n > t, if n e S, then n + 1 e S, 

then the principle can be used to prove that S contains all integers equal to or greater 
than t. This form is sometimes called “induction from t .” 
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The Well-Ordering Principle 

Another principle that is very useful in proving results about sets of positive integers 
is: 


The well-ordering principle Suppose S is any non-empty set of positive integers. 
Then S has a smallest member. 

Any set with the property that every non-empty subset has a least member is 
called being well-ordered ; so the principle says “the positive integers are well- 
ordered.” Many number-sets, such as the real numbers, are not well-ordered. 

At first the result seems obvious. If S is not empty, then it must contain some 
member, x say. In order to find the smallest member of S , one need only check to see 
whether or not x — 1, x — 2, ...,1 are members of S , and this requires only a finite 
number of steps. However, this “proof” contains the same problem as the “proof” of 
the principle of mathematical induction: the starting value, x, can be arbitrarily large. 

In fact, the induction principle can be proved from the well-ordering principle. 
To see this, let us assume the well-ordering principle is true and suppose P is any 
proposition about positive integers such that P(l) is true, and for every positive 
integer n, whenever P(n) is true, then P(n + 1) is true. (These are the requirements 
for induction.) Write S for the set of positive integers n such that P(n) is not true. In 
order to prove that induction works, we need to show that S is empty. 

If S is not empty, then by well-ordering S has a smallest member, x say. So P(x) 
is false. We know that P( 1) is true, so x / 1. But x is a positive integer. So x — 1 is 
a positive integer, and P(x) must be true—otherwise x — l would be a member of S , 
and smaller than x. But this means P((x — 1) + 1) is true: that is, P(x) is both true 
and false! This can’t happen, so our original assumption, that S is not empty, must 
have been wrong. So induction is proved. 

We can also work the other way. If you assume induction is true, you can show 
that the positive integers are well-ordered. 

However, there is no absolute proof here. It is necessary to assume that either 
induction or well-ordering is a property of the positive integers. So we assume these 
as axioms about numbers. 


Some Applications 

We start by looking at some properties of sets and prove them by induction. 
Theorem 9. Let Abe a set with \A\ = n. Then \V(A)\ = 2 n . 

Proof. To apply induction, we can rephrase the statement as: Let P(n) be the state¬ 
ment “\V( A)| = 2 n for any n-element set Athen P(n) is true for all posi¬ 
tive integers n. Since the elements of A do not matter, we may as well assume 
A = [a — 1, < 22 , ..., a n }. 
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First we consider the case n = 1, so A = {a\}. Then V(A) = {0, A}, so 
\V(A)\ = 2 1 and in this case the theorem is true. Now suppose the result has been 
proved for ft = k — 1 and assume n = k; we might as well say A = A! U {ak} where 
A' = {a — 1, ci 2 , ..., By the induction hypothesis IT^A')! = 2 k ~ l . Any subset 

of A is either a subset of A! or a subset of A' with the element ^ adjoined to it, so to 
each subset of A' there correspond two subsets of A. Hence \V(A)\ = 2 k ~ l • 2 and 
the theorem is proved. □ 

We consider next the cardinality of a cartesian product of two finite sets. 
Theorem 10. If\S\ = m and \ T\ = n, then \S x T\ = mn. 

Proof. We proceed by induction on m. If m = 1, then S = {^i} and an ordered pair 
with si as its first element may be constructed in n ways, giving (s \, t\), (si, t 2 ),..., 
(s i, t n ), so the theorem is true for m = 1. Now suppose the statement is true for 
m — k — 1, and consider the case m = k. 

Let S' = {si, S 2 , • • •, S£_i}, so | S' x T\ = (k — 1 )n. Also, |{s&} x T\ = n. But 
S x T = (S' x T) U ({s£} x T) and since the two products on the right-hand side of 
this equation are disjoint, we know that |S x T\ = (k—l )ft + ft, proving the theorem. 

□ 

Mathematical induction is very often used in proving general algebraic formulas. 

Sample Problem 2.18. Prove hy induction that the sum of the first n positive 
integers is 

1 

1 + 2 H-1- ft = -ft(ft + 1). 

Solution. The case n = 1 is ^ -1(1 + 1) = 1, which is obviously true, so the 
formula gives the correct answer when n — 1. Suppose it is true when n = k — l; 
therefore 

1 + 2 + ••• + (& — 1) = — (k — 1 )k. 

Then 

1 + 2 + • • • + (k - 1) + k = ^(k - l)k + k 
= - (k 2 — k + 2 k) 

= ^*(*+ 1 ), 

and the formula is proved correct when n = k. So, by induction, we have the 
required result. 
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Fig. 2.3. How lines divide the plane 

Practice Exercise. Prove that the sum of the first n odd positive integers is 
1 + 3 + • • • + (2n — 1) = n 2 . 

The following example looks geometrical, but it also yields to induction. 

Sample Problem 2.19. Suppose n straight lines are drawn in two-dimensional 
space, in such a way that no three lines have a common point and no two lines 
are parallel. Prove that the lines divide the plane into ^0 n 2 + n + 2) regions. 

Solution. It will probably help if we first look at a small example. The left-hand 
diagram in Figure 2.3 shows the plane divided into seven regions A, B, ..., G 
by three lines. When a fourth line is introduced, it passes through four regions: 
it starts in D , which it divides into two regions B\ and B 2 ; it then crosses a line 
into B , which it divides into two; then it crosses into C, and E. Each time it 
crosses a line, it divided a region into two. 

Now for the formal induction. If n = 1, the formula yields ^(1 + 1 + 2) = 2, 
and one line does indeed partition the plane into two regions. Now assume that 
the formula works for ft = k — 1. Consider k — 1 lines drawn in the plane. From 
the induction hypothesis, the plane is divided into \{{k — l) 2 + (k — 1) + 2) = 
\{k 2 — k + 2) regions. 

Now insert a kth line. It must cross every other line exactly once, so it crosses 
(k — 1) lines, and lies in k regions (the one in which it started, and another after it 
crosses each line). It divides each of these regions into two parts, so the k regions 
are replaced by 2 k new regions; the total is 

1(£ 2 - k + 2) - k + 2k = i(£ 2 - k + 2 - 2k + 4k) 

— 2 (^ 2 ^ 2 ) > 


and the result is true by induction. 
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Here is an example that uses induction from the starting point 4, rather than from 
0 or 1. 

Sample Problem 2.20. Prove that n\ >2 n whenever n > 4. 

Solution. Suppose the proposition P(n) means n\ > 2". Then P(4) means 
4! > 2 4 , or 24 > 16, which is true. Now suppose k is an integer greater than 
or equal to 4, and P(k) is true: k\ > 2 k . Multiplying by k + 1, we have 
(it + 1)1 > ( 2 k (k + 1) > 2 k 2 = 2* +1 , so P(fr) implies P(fr + 1), and the 
result follows by induction. 

Practice Exercise. Prove that n 2 > 2ft + 1 whenever n > 3. 

Sample Problem 2.21. Prove fry induction that 5 n —2 n is divisible by 3 whenever 
n is a positive integer. 

Solution. Suppose P(n) means 3 divides 5 n — 2 n . Then P(l) is true because 
5 1 — 2 1 = 3. Now suppose k is any positive integer, and P(k) is true: say 
5 k — 2 k = 3x, where x is an integer. Then 5^ +1 — 2 k+l = 5 • 5^ — 2 • 2 k = 
3 • 5^ + 2 • 5^ — 2 • 2 k = 3 • 5^ + 2 • 3x, which is divisible by 3. So the result 
follows by induction. 

Practice Exercise. Prove that 3 2/1 — 2 n is divisible by 7 whenever n is a positive 
integer. 

The Fibonacci numbers /i, fi, / 3 , • •. are defined as follows. f\ = fa — 1, and 
if ft is any integer greater than 2, /„ = / w _i + f n -2- This famous sequence is the 
solution to a problem posed by Leonardo of Pisa, or Leonardo Fibonacci (Fibonacci 
means son of Bonacci) in 1202: 

A newly born pair of rabbits of opposite sexes is placed in an enclosure 
at the beginning of the year. Beginning with the second month, the female 
gives birth to a pair of rabbits of opposite sexes each month. Each new pair 
also gives birth to a pair of rabbits of opposite sexes each month, beginning 
with their second month. 

The number of pairs of rabbits in the enclosure at the beginning of month n is f n . 

Some interesting properties of the Fibonacci numbers involve the idea of con¬ 
gruence modulo a positive integer. We say a is congruent to b modulo ft, written 
“a = b (mod ft),” if and only if a and b leave the same remainder on division by ft. 
In other words ft is a divisor of a — b, or in symbols n \ (ft — b). As an exam¬ 
ple, both 15 and 39 leave remainder 3 on division by 12, so 15 = 39 (mod 12); and 
39 — 15 = 24 = 2 x 12, so 12 is a divisor of 39 — 15. This idea will be explored 
further in Sample Problem 4.6 and in Section 9.2. 
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Sample Problem 2.22. Prove by induction that the Fibonacci number f n is even 
if and only if n is divisible by 3. 

Solution. Assume n is at least 4. f n = /„_i + f n -i = (/*-2 + fn-z) + fn- 2 = 
fn—3 + 2/*_ 2 , SO /* = / n _3 (mod 2). 

We first prove that, for k > 0, fok is even. Call this proposition P(k). Then 
P(l) is true because fo = 3. Now suppose k is any positive integer, and P(k) 
is true: fck = 0 (mod 2). Then (putting ft = 3k + 3) / 3 (£+i) = f?>k (mod 2) = 
0 (mod 2) by the induction hypothesis. So P(k + 1) is true; the result follows by 
induction. To prove that, for k > 0, / 3&-1 is odd—call this proposition Q(k )— 
we note that 2(1) is true because f\ = 1 is odd, and if Q(k) is true, then fck-\ is 
odd, and f3(k + 1) — 1 = fok-2 (mod 2) = 1 (mod 2). We have Q(k + 1) and 
again the result follows by induction. The proof for k = 1 (mod 3) is similar. 

Practice Exercise. Prove by induction that the f n is divisible by 3 if and only if 
n is divisible by 4. 

Some further properties of Fibonacci numbers appear among the Exercises. 

Exercises 2.5 

In Exercises 1 to 6 , prove the given proposition by induction. 

1- £”=i r 2 = \n(n + 1)(2 n + 1). 

2. EJU k3 = E=i k f = \ n2 ( n + 1 > 2 - 

3. 1 + 4 + 7 + • • • + (3n - 2) = i«(3n - 1). 

4. 2 + 6 + 12 + • • • + n(n + 1) = £Li *(Jfc + 1) = ^n(n + l)(n + 2). 

5 J._|—1_|—1_L ... a- 1 _ — —H _ 

1-3 ^ 3-5 ^ 5-7 ^ ^ (2n-l)(2n+l) — 2n+l * 

6 . 1 + 3 + 3 2 + • • • + 3" = i(3 " +1 - 1). 

7. Write down the first twelve Fibonacci numbers. 

In Exercises 8 to 11, prove the given result about the Fibonacci numbers, for all 
positive integers n. 

8. f n is divisible by 4 if and only if n is divisible by 6 . 

9. fl + h H-f fn = Zrc +2 — 1 - 

10. fl + /3 + • • • + fln—l = /2n- 

11 . /„ 2 + / n 2 + 1 =/ 2 n+l. 

12. The numbers < 20 , 01 , 02 , ... are defined by <20 = 5 and = 2a n (l—a n ) when 
n > 0. Prove that 
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13 . The numbers ao, a\ , < 22 , ... are defined by = 3 and 

= 2<z n — a n when n > 0. 

Prove that a n = 1 — 2 2 ” when ft > 0, although this formula does not apply when 
n = 0. 

14 . Show by induction that 2 n > n 2 for n > 4. 

In Exercises 15 to 18, prove the divisibility result for all positive integers n. 

15 . 2 divides 3 n — 1. 

16 . 6 divides n 3 — n. 

17 . 5 divides 2 2n ~ l +3 2 ^“ 1 . 

18 . 24 divides n 4 — 6n 3 + 23 n 2 — 18 n. 

19 . Prove that the sum of the cubes of any three consecutive integers is a multiple 
of 9. 

20 . The numbers x\ , X 2 , ... are defined as follows. x\ = 1, X 2 = 1, and if ft > 2 then 

x^+i = + 2x n _i. Prove that x w is divisible by 3 if and only if n is divisible 

by 3. 

21 . Prove by induction: if n people stand in line at a counter, and if the person at the 
front is a woman and the person at the back is a man, then somewhere in the line 
there is a man standing directly behind a woman. 

22 . Assume that the sum of the angles of a triangle is it radians. Prove by induction 
that the sum of the angles of a convex polygon with n sides is ( n — 2) it radians 
when n > 3. 

23 . Consider the set of real numbers: {x : x 2 < 1}. Show that this set has no least 
member. Use this to prove that the real numbers are not well-ordered. 

24 . Let M° be the set of non-negative real numbers, {x : x e M, x > 0}. Is M° 
well-ordered? 

25 . Assuming the principle of mathematical induction, show that the positive inte¬ 
gers are well-ordered. 

26 . Find the errors in the following “proofs”: 

(i) Theorem. All computer programs contain the same number of bugs. 

Proof. If we show that in any set of n programs, all the programs contain 
the same number of bugs, then we have proved the theorem. We proceed by 
induction on n. 

First, let n = 1. Certainly, in a set consisting of one program, all the pro¬ 
grams contain the same number of bugs, so the statement is true for n = 1. 

Now suppose that for every set containing fewer than n programs, all the 
programs in the set contain the same number of bugs, and consider a set D of 
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n programs, D = {p\, p 2 , ..., p n }. Remove the first program and consider 
D\ = {p 2 , p ?>, ..., p n }, a set of n — 1 programs. By the induction hypothesis, 
all the programs p 2 , ..., p n contain the same number of bugs. Now replace 
the first program and remove the last, forming D 2 = {p \, P 2 , • • •, Pn-l }, an¬ 
other set of ft — 1 programs. By the induction hypothesis, all of p \, ..., p n ~\ 
have the same number of bugs. Hence p\ and p n each have the same num¬ 
ber of bugs as the other programs in the set, so all the programs contain the 
same number of bugs. The theorem follows by induction. 

(ii) Theorem. All computer programs contain the same number of bugs. 

Proof. Any program must have a non-negative number of bugs, so the 
possible numbers are {0, 1, ..., N} for some large (but finite) number N. 
Choose any two programs and compare the number of bugs, say r and 5 , 
contained in them. If max{r, 5 } = 0, then r = s = 0. Now suppose that if 
max{r, 5} < ft — 1, then r = s, and consider the case where max {r, s} = n. 
This implies that max{r — 1,5 — 1} = n — 1 and hence r — l = s — lby the 
induction hypothesis. Hence r = s. Since any two programs have the same 
number of bugs, all programs must have the same number of bugs, and the 
theorem is proved. 
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Boolean Algebras and Circuits 


Observe the similarities between the algebra of sets and the algebra of propositions: 
the universal set plays the same role as T, the empty set 0 corresponds to F, U is 
like V, and Hi is like A. This similarity is exploited by defining an object, called 
a Boolean algebra. The definition is done in such a way that the set of all subsets 
of a fixed set can be interpreted as a Boolean algebra, and a well-defined set of 
propositions also forms a Boolean algebra. 

Boolean algebras are defined from a set of axioms. We show how to derive a 
number of properties directly from these axioms, and investigate other ways to prove 
results about Boolean algebras. 

A Boolean algebra can be used to analyze electrical circuits. We shall describe 
how this is done, and introduce the circuits used to perform arithmetic in computers. 


3.1 Boolean Algebra 

Boolean Algebras Defined 

By a binary operation on a set S we mean a way of combining an ordered pair of 
elements of S to give a uniquely defined element of the set. There are many familiar 
examples of binary operations in arithmetic: for example, addition is a binary opera¬ 
tion on Z, or on Z + , or on other number sets. Subtraction is a binary operation on Z, 
but it is not a binary operation on Z + because some pairs do not give a member of the 
set: for example, 1 and 2 are members of Z + , but 1 — 2 is not in Z + . This property 
of belonging to the original set is called closure. Similarly, a unary operation is a 
rule that associates with each member of S some uniquely defined member of S ; for 
example, the multiplicative inverse operation is a unary operation on M*, but not on 
M (because 0 has no inverse). 

Suppose B is a set containing two distinct special elements called 0 and 1: sup¬ 
pose there are defined on B two binary operations, which we shall denote by + 
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and x, and a unary operation, denoted by '. (We write x + y for the result of apply¬ 
ing the operation + to (x, y), and x ' for the result of applying ' to x.) Then we call 
B a Boolean algebra if the following five pairs of laws hold for all members x, y, 
and z in B. 

B1. Commutative Laws : 

(a) x + y = y + x, 

(b) x x y = y x x. 

B2. Associative Laws: 

(a) (x + y) + z = x + (y + z), 

(b) (x x y) x z = x x (y x z). 

B3. Distributive Laws: 

(a) x + (y x z) = (x + y) x (x + z), 

(b) x x (y + z) = (x x y) + (x x z). 

B4. Identity Laws : 

(a) x + 0 = x, 

(b) x x 1 = x. 

B 5. Complement Laws : 

(a) x + x ' — 1, 

(b) x x x ' — 0. 


We shall call these laws the Axioms of Boolean algebra. 

The elements 0 and 1 are usually called the “zero” and “unity” elements of the 
Boolean algebra. Similarly, the operations + and x are often called “sum” and “prod¬ 
uct,” but they are not the same as the ordinary sum and product of numbers. In par¬ 
ticular, notice that the first distributive law, B3(a), is not true in ordinary arithmetic. 
The element x ' is called the complement of x. 

Sample Problem 3.1. Let U be any set. Show that the power set V(U) is a 
Boolean algebra with the roles of 0, 1, +, x, and ' taken by 0, U, U, Pi, and 
respectively. 

Solution. Rules Bl, B2, and B3 are the commutative and associative laws for 
union and intersection and the distributive laws (2.2) and (2.3). The identity laws 
simply state S U 0 = S and SHU = S, and the complement laws were proved 
in Exercises 2.2.13 and 2.2.17. 

Practice Exercise. Let B be the set of all propositions that can be expressed in 
English. Show that B is a Boolean algebra with the roles of 0, 1, +, x, and ' 
taken by F,T,v, A, and respectively. 

We now introduce another important Boolean algebra, the two-element alge¬ 
bra 62 - This algebra is used to analyze electrical circuits. B 2 has two elements, 
called 0 and 1. The operations on B 2 can be described in ordinary arithmetic as 
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follows: x + y equals the greater of x and j,xxj equals the lesser of x and y, and 
x ' — 1 — x. In other words, the operations are given by the tables 


+ 

0 

1 

X 

0 

1 

X 

x' 

0 

0 

1 

0 

0 

0 

0 

1 

1 

1 

1 

1 

0 

1 

1 

0 


For brevity we adopt three notational conventions that are like those used in ordi¬ 
nary arithmetic. First, we say that ' takes precedence over x and x takes precedence 
over +. For example, the expression x x y + z means (x x y) + z, not x x (y + z), 
and x x y' means x x (y r ), not (x x y)'. Second, we omit the x sign when no 
confusion arises, so that xy means the same as x x y. And finally, we omit brackets 
in the expression x + (y + z) and (x + y) + z: the associative law tells us that these 
two expressions are equal, so we simply write x + y + z. We do the same for sums 
of more than three terms, and also for extended products. 

Sample Problem 3.2. Write the following expressions with as few symbols as 

possible, using the conventions: x + ((y x (z r )) + t), x x ((x + (y')) + z). 

Solution, x -\- yz' + t, x(x + y' + z). 

Practice Exercise. Do the same for the expression (x x (y')) x ((z') x x). 


Some Theorems about Boolean Algebras 

Suppose you are given any formula in a Boolean algebra. You could derive another 
formula by interchanging + and x throughout, and also interchanging 0 and 1. This 
new formula is called the dual of the original. For example, the dual of x + y ' = 0 
is x x y ' = 1. 

Observe that each of the axioms B1 through B5 consists of a statement and its 
dual. So, if we can use the axioms to prove any theorem about Boolean algebras, 
then we could also use them to prove the dual of the theorem. We state this result 
formally as follows. 

Principle of Duality. The dual of every theorem in Boolean algebra is also a theo¬ 
rem. 

The effect of this is to halve the amount of work we need to do, in the sense that 
every proof gives both a theorem and its dual theorem. 

We shall list several theorems that hold in any Boolean algebra. Some proofs will 
be given as examples; others are left as exercises. In each case, the theorems come 
in pairs, one statement being the dual of the other. 
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Theorem 11 (Further Distributive Laws). In any Boolean algebra, all elements 
x , y, and z satisfy (a) (x + y)z = xz + yz and (b) xy + z = (x + z) (y + z). 

Theorem 12 (Idempotent Laws). In any Boolean algebra, every element x satisfies 
(a) x + x = x and (b) xx = x. 


Sample Problem 3.3. Prove the idempotent laws. 

Solution. We prove the statement (a) using the axioms. Statement (b) is then 
true because it is the dual of (a). 

x = x + 0 by B4(a) 

= x + xx ' byB5(b) 

= (x+x)(x+x / ) by B3(a) 

= (x + x)l by B5(a) 

= (x + x) by B4(b). 

Practice Exercise. Prove the second idempotent law from the axioms. 

Theorem 13 (Absorption Laws). In any Boolean algebra, all elements x and y sat¬ 
isfy (a) x + xy = x and (b) x x (x + y) = x. 


Theorem 14 (Nullity Laws). In any Boolean algebra, every element x satisfies (a) 
x + 1 = 1 and (b) x x 0 = 0. 

The nullity laws are so named because 0 is a null element for product, just as the 
number 0 is for ordinary multiplication. They are also called boundedness laws. 

Sample Problem 3.4. Prove the boundedness laws from the axioms. 

Solution. Again we prove only one statement using the axioms — in this case 
the statement (b). 


xO = xO + 0 

by B4(a) 

= xO + xx ' 

by B5(b) 

x 

+ 

X 

II 

by B3(b) 

'o 

+ 

X 

II 

by B 1(a) 

= XX ' 

by B4(a) 

= 0 

by B5(b). 


Practice Exercise. Prove the absorption laws from the axioms. 

Although we write the laws in terms of single letters, they apply to any expres¬ 
sions involving elements of a Boolean algebra. 
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Sample Problem 3.5. Prove that zt + ytz = tz . 

Solution. From the commutative laws, zt = tz, and ytz = tzy, so zt + ytz = 
tz + tzy . Define x — tz. From the absorption law, x + xy = x, so tz + tzy = tz. 

Practice Exercise. Prove that yz + yz — yz. 

The complement of a set is defined in a natural way, and obviously no set can 
have two complements. Similarly, the negation of a proposition is a unique proposi¬ 
tion. This uniqueness is actually a property of all Boolean algebras. 

Theorem 15 (Uniqueness of Complement). If x and y are any elements of any 
Boolean algebra, and if x + y = l and xy = 0, then y = x 


o 

+ 

II 

X 

by B4(a) 

= x ’ + x y 

from the data 

= (x’ + x)(x’ + y) 

by B3(a) 

= l(*' + y) 

by B5(a) 

= (.X + y){x' + y) 

from the data 

= 

from the data 

= y + xx ' 

by B3(b) 

— yO 

by B5(a) 

= y 

by B4(b). 


This enables us to prove some further theorems about complements. 

Theorem 16 (Involution Law), (x ')' = x. 

Theorem 17. 0 ' = 1 and 1' = 0. 

Theorem 18 (de Morgan’s Laws). In any Boolean algebra, all elements x and y 

satisfy (a) (x + y) ' = x 'y ' and (b) (xy) ' = x ' + y '. 

Exercises 3.1 

1. Let B be the set of all positive integer divisors of 15, that is, B = {1, 3, 5, 15}. 
Prove that B forms a Boolean algebra with zero element 1 and unity element 15, 
provided the operations are defined as follows: x + y is the least common mul¬ 
tiple of x and y, xy is the greatest common divisor, and x ’ is the ordinary arith¬ 
metical quotient 15/x. 

2. Show that the procedure outlined in Exercise 1 also produces a Boolean algebra 
when 15 is replaced by 
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(i) 30; 


(ii) 42. 


3. Show that the procedure outlined in Exercise 1 does not produce a Boolean al¬ 
gebra when 15 is replaced by 12. 

In Exercises A to 11, give the dual of the Boolean expression. 


4. xy + 1. 

6. yz + zx. 

8. x + y'. 

10. x(xy + z). 


5. xy + z'. 

7. x'yz. 

9. x(y + z')y. 

11. xy yz + zx. 


12. B consists of the eight elements {0,1 , a, b, c, d, e, f] with the following opera¬ 
tions. 


+ 

0 

a 

b 

c 

d 

e 

/ i 

X 

0 

a 

b 

c 

d 

e 

f 

1 

X 

x’ 

0 

0 

a 

b 

c 

d 

e 

/ i 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

a 

a 

a 

c 

c 

e 

e 

i i 

a 

0 

a 

0 

a 

0 

a 

0 

a 

a 

/ 

b 

b 

c 

b 

c 

f 

1 

/ i 

b 

0 

0 

b 

b 

0 

0 

b 

b 

b 

e 

c 

c 

c 

c 

c 

1 

1 

i i 

c 

0 

a 

b 

c 

0 

a 

b 

c 

c 

d 

d 

d 

e 

f 

1 

d 

e 

/ i 

d 

0 

0 

0 

0 

d 

d 

d 

d 

d 

c 

e 

e 

e 

1 

1 

e 

e 

i i 

e 

0 

a 

0 

a 

d 

e 

d 

e 

e 

b 

f 

f 

1 

/ 

1 

f 

1 

/ i 

f 

0 

0 

b 

b 

d 

d 

f 

f 

f 

a 

1 

1 

1 

1 

1 

1 

1 

i i 

1 

0 

a 

b 

c 

d 

e 

f 

1 

1 

0 


Prove that B is a Boolean algebra. 

13. Use the axioms to prove the third and fourth distributive laws (Theorem 11). 

14. Use the axioms to prove de Morgan’s Laws (Theorem 18). 

In Exercises 15 to 18, prove that the results hold in any Boolean algebra. 

15. x + x'y = x + y for any x and y. 

16. x + xy + y = (x + y)(y + 1) for any x and y. 

17. (x + y')z' = ((.x' + z)(y + z)) 7 for any x, y andz. 

18. x + y(x + z) = (x + y)(x + z) for any x, y and z. 

19. Write down the duals of the equations in Exercises 15 to 18. 

20. Prove that, in any Boolean algebra, x + y+x'y'z = x + y + z, for any x, y, 
and z. 

21. Suppose the elements x, y, and z in a Boolean algebra satisfy x = yz' + y'z. 
Prove that x' = yz + y'z'. 
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22. In ordinary arithmetic, the cancellation law for addition is the law that says the 
following: if the integers x, y, and z satisfy x + y = x + z, then y = z. Prove that 
there is no cancellation law for = or for x in Boolean algebras. (It is sufficient 
to show that, in any Boolean algebra, there are elements x, y, and z such that 
x + y = x + z, but y ^ z.) 

23. Show that in any Boolean algebra, if the elements x, y, and z satisfy x+y = x+z 
and x ' + y — x' + z, then y = z- (Compare this with the preceding exercise.) 

24. Define a structure B as follows. The elements of B are the integers Z. However, 
x + y means the greater of x and y, and xy means the lesser of x and y. 

(i) Show that B satisfies the associative, commutative, and distributive laws 
Bl, B2, and B3. 

(ii) Show that, no matter which element of Z is chosen to be 0, the identity law 
x + 0 = x does not hold. 

(iii) Is this true if the elements of B are taken to be the nonnegative integers? 

25. Suppose B is any Boolean algebra. Define A to be the set B x B, consisting of 
all the ordered pairs (x, y) with x and y in B. Define operations sum, product, 
and complement on A by 

(x, y) + (z, t) = (xy, z + t), 

(• x,y)(z,t) = (x+y,zt), 

(x,y)' = (x',y'). 

Prove that A is a Boolean algebra. What are its zero and unity? 


3.2 Boolean Forms 

Disjunctive Forms 

By a Boolean expression in the variables {x, y, ...} we mean any expression built up 
from the variables using the variables, the operators +, x, and \ and parentheses. 
Examples of Boolean expressions in {x, y, z} include x, y', xy + yzx\ and (x + 
yz)(x ' + yz). In particular, Boolean expressions consisting of a single variable or its 
complement, like x and y', are called literals. 

Sample Problem 3.6. Consider the Boolean expression a(x,y,z) = xyz' + 
x(y + z) + (x + z)'. Find the values of a(l, 1, 1), a(l, 0, 0), and a(0, 1, 1). 

Solution. a(l, 1, 1) = 111 / +1(1 + 1) + (1 + 1) / = ll / +10+0 / = 0+0+1 = 1. 
a(l, 0, 0) = 100 r + 1(0 + 0) + (1 + 0) 7 = 0 + 10 + 1 7 = 0 + 0 + 0 = 0. 
a(0, 1,1) = 011 / + 0(l + l) + (l + l) / = 0 + 0 + 0 / = 0 + 0+l = 1. 
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Practice Exercise. If /3(x, y, z) = xy + yzx evaluate /3( 1, 0, 1), /3(1, 1, 0), 
and /3(0, 1, 1). 

Intuition suggests that a Boolean expression is less complicated when it involves 
fewer symbols. For this reason, we prefer forms in which the number of repeated 
symbols is smaller. We define a fundamental product to be a literal or a product of 
literals that contains no repeated variable. The products x, xy, x 'zy, and y' are fun¬ 
damental products, but xyx ' and yxyz are not. Any product is equal to a fundamental 
product or to 0: if it contains two copies of the same literal x, we can use the idem- 
potence law (Theorem 12(b)) to replace xx by x, and if it contains both x and x' we 
can use the complement law (Axiom B5(b)) to replace their product, and in fact the 
whole product, by 0. 

Sample Problem 3.7. Reduce the following products to a fundamental product 
or to 0: zxyxz, x 'yyx ’. 

Solution, zxyxz = xxyyz (repeated use of the commutative law) = xyz (two 
uses of the idempotent law), x 'yyx ' = x’x’yy (commutative law) = Oy = 0 
(idempotent law, nullity law). 

Practice Exercise. Reduce the following products to a fundamental product or 

0: xy'x f z f ,x f yz f y. 

If a and ft are two fundamental products, we say that f includes a and write 
a < f if every literal in a is also a literal in f. A literal and its complement are 
treated as different. So xz < xyz is true, but xz' < xyz is false. If a < f>, then 
a + f> — a, by the absorption law: for example, xz + xyz = xz + (xz)y = xz. 

We say that a Boolean expression is a disjunctive form if it equals a sum of 
fundamental products and none of the products includes any other product. For ex¬ 
ample, xy + xz + yz is in disjunctive form, but xy + xz + xyz is not. However, 
xy + xz + xyz = xy + xz, so xy + xz + xyz equals a disjunctive form. The funda¬ 
mental products are called the terms of the form. (This is just the way “term” is used 
in ordinary arithmetic.) 

Theorem 19. Any Boolean expression is equal to a disjunctive form. 

Proof. We present an algorithm, in four steps, to reduce any normal form to a dis¬ 
junctive form. 

Step 1. By repeated application of de Morgan’s laws, move all complement signs 
inside parentheses. Then use the involution law to reduce any strings of comple¬ 
mentation to one or no complements (x w = x', x nn — x, and so on.) Then the 
form will be comprised entirely of literals, parentheses, and + and x operations. 


3.2 Boolean Forms 


75 


Step 2. We may assume that the form is something like a + /3 + • • •, where each 
of a, P, ..., is a product. If any of these terms is a product of literals, leave it 
unchanged. Otherwise, if (say) a is a product in which one or more of the terms 
is a product of sums, transform it into a sum of products using the distributive 
law B3(a). 

Step 3. Use the commutative, idempotent, and complement laws to transform each 
product into 0 or a fundamental product, as we did in Sample Problem 3.7. 

Step 4. Any zero term can be deleted. If any term is included in any other, use the 
absorption law to delete one term. □ 

Sample Problem 3.8. Illustrate Theorem 19 by reducing {xy') '{x +x 'z)' to dis¬ 
junctive form. 

Solution. Step 1: de Morgan’s laws give {xy')' = x' + y" = x r + y and 
{x -\-x'z)' = x'{x'z)' = x '{x " + z 0 = x '{x + z 0- So the original form equals 
(x' + y)(*'(*+*')). 

Step 2: (x' + y)(x f (x + z')) = (*' + y){x'x + x'z') = {x' + y)x'x + 
{x' + y)x 'z' = x'x'x-\-yx'x-\-x'x'z'-\-yx' z'. 

Step 3: x'x'x + yx'x + x'x'z' + yx'z' — x'x yx'x + x'z' + yx'z' = 
0 + y0 + * V + yx V = x'z' + yx'z'. 

Step 4 : x'z' + yx'z' = x'z' + x'z'y = x'z'. 

Practice Exercise. Illustrate Theorem 19 by reducing (v 'y) '(x' + vyz ') to dis¬ 
junctive form. 


Minimal Forms and Prime Implicants 

Suppose a and ft are two disjunctive forms. We shall say that a is simpler than 
f if either a contains fewer terms than P and contains no more literals than p , or 
a contains fewer literals than P and contains no more terms than p. We say the 
disjunctive form a is minimal if no disjunctive form equal to a is simpler than a. 

A fundamental product n is called a prime implicant for a Boolean expression a 
if n + a = a and if no fundamental product included in n has this property. 

To show that jx is a prime implicant for a Boolean expression a involves two 
kinds of proof. We usually prove that n + a = a by manipulating the formula for 
7T + a. For every fundamental product a included in n we are then required to prove 
that a + a / a. This is normally done by showing that there is some assignment of 
values to the variables that gives a + a and a different values. 

Sample Problem 3.9. Show that xz is a prime implicant of a = xy + vy 'z + 
x'yz. 
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Solution. 

xz -\- a = xz + xy + xy'z+ x 'yz 

= x(y + y f )z + xy + xy f z + x f yz 
= xyz + xy 'z + xy + xy 'z + * 'yz 
= (xy + xyz) + (xy 'z + xy 'z ) + x 'yz 
= xy + xy'z + x 'yz 
= a. 

The only fundamental products included in xz are x and z. To show that x + a / 
o', it is sufficient to observe that if x = 1 and y = z = 0, we have x + a = l, a = 
0. For the form z, the substitution z = l,x = y = 0 gives z + ex = 1, a = 0. 

Practice Exercise. Show that xy is a prime implicant of /3 = xyz + xy 'z + 

xyz' + x'y 'z'. 

The importance of prime implicants is explained by the following theorem. 

Theorem 20. If a is a minimal disjunctive form, then the terms of a are prime impli¬ 
cants of a. 

Proof. Suppose a is a minimal disjunctive form. We show that every term of a is a 
prime implicant. Let tt be such a term. Then we can write 

a = tv + y, 

where y is the sum of the other terms of a. Then 

7r+of = 7r+7r + }/=7r + ]/=Qf. 

On the other hand, if jt included any product f that satisfied f + a = a, then 
a = y 0+a = (j0 + 7r) + }/=y0 + }/. 

This new form is simpler than the original since it contains as many terms as the 
original and contains fewer literals (f contains fewer literals than Tt). But this is 
impossible as a is minimal. □ 

It follows that a minimal disjunctive form is the sum of its prime implicants, so 
we can construct the minimal disjunctive form of a Boolean expression by finding its 
prime implicants. For this reason we shall present (in the next section) a technique 
for finding all prime implicants of a Boolean expression. 


The Disjunctive Normal Form 

A Boolean expression in the set of variables S is said to be in disjunctive normal 
form (or complete) if it is in disjunctive form and every member of S appears in 
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every term. For example, x'y + xy' is in disjunctive normal form over {x, y} but 
x'y + x is not. 

The set S is important in this definition. If S is {x, y, z}, then xy -\- x'y' is not 
complete over S. To avoid the need to specify S every time, we usually assume that 
S consists of precisely the variables appearing in the expression, so that if we write 
xy -\- x'y' we assume that S = {x, y} unless told otherwise. 

Every Boolean expression can be put into disjunctive normal form using the com¬ 
plement law. For example, v = xl = x(y + y') = xy + xy', so x'y + x = 
y + xy + xy', which is complete. 

Sample Problem 3.10. Put into disjunctive normal form xyz + xz', xy + xz. 

Solution, vyz + xz' = vyz + x(y + y')z' = vyz + xyz' + xy'z', xy -\- xz = 
xy(z + zf) + xfy + y/)z = vyz + xyzt + vyz + xytz. 

Practice Exercise. Put xy'-\-xz'-\-yz into disjunctive normal form. 

Disjunctive normal forms are useful because two disjunctive normal forms in the 
same set of variables are equal if and only if they contain precisely the same terms. 
Surprisingly enough, we shall also use them in the next section as a tool for finding 
prime implicants, and consequently, minimal forms. 

Exercises 3.2 

1. The Boolean expression a(x, y) = (xy + xy ')(x' + y). Evaluate 

(i) a(0,0); (iii) a(l,0); 

(ii) a(0, 1); (iv) a(l, 1). 

2. The Boolean expression f(x, y) = (x + y + xy)(x + y'). Evaluate 

(i) 0(0,0); (iii) 0(1,0); 

(ii) 0(0, 1); (iv) 0(1, 1). 

3. The Boolean expression y(x,y,z) = (x + y + z)(x' + y' + z'). Evaluate 

(i) y(0,0,1); (iii) y(1,1,0); 

(ii) y(0,1,0); (iv) y(1,1,1). 

4. The Boolean expression <$(jc, y, z) = (jc + y f z)(y + z')- Evaluate 

(i) 5(1, 0,0); (iii) 5(1, 1,0); 

(ii) 5(0, 0,1); (iv) 5(1, 1,1). 

In Exercises 5 to 20, the Boolean expressions a, /3, y, and 8 are defined as in Exer¬ 
cises 1 to 4. Evaluate the given expression over the eight-element Boolean algebra 
B defined in Exercise 3.1.12. 
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5. a(l, e). 

6. a(a, b). 

7. a(f, 1). 

8. P(c,e). 

9. 

10. P(b,a). 

11. a(e, l)fi(a, c). 

12. a(a,b)+/3(e,f). 

13. y (1, e , a). 

14. y (a, b, e). 

15. y (1, /, 0). 

16. S(e,c, 1). 

17. 8(f,d,d). 

18. 5(1,/?, a). 

19. y(e, a, l)8(b, b, c). 

20. y (0, a, /?) + 8(e, d, 1). 


In Exercises 21 to 24, reduce each product to a fundamental product or zero. 
21. xyy'xy. 22. xyxzyzx. 

23. xy'zx. 24. yyxyyz . 

In Exercises 25 to 32, reduce the Boolean expression to disjunctive form. 

25. x(xy'+x'y). 26. *(*' + /)'. 


27. (x + ;y)(x'+ z)(j + z). 

28. (x + ^'Cx'y)'. 

29. (x + y)(x + z)(y + z). 

30. (x + y)xyz. 

31. (x + y)(x' + y'). 

32. (x + yKx + zOCy' + z)- 


In Exercises 33 to 38, find a simpler form for the Boolean expression. 
33. x+x 7 y. 34. xy 7 x 7 + xyx 


35. yx + yz' + y'xz + z. 

36. x + yz + x'y + y’xx. 

37. xy V + xyz 7 . 

38. xy —|— x —|— y. 


In each of Exercises 39 to 42, A 7T a prime implicant of a? 

39. 7Z = xz', a = xy' + xyz' x'yz'• 

40. 7t = xya = xy + xy'z' + x'y'z . 

41. 7T = xy, a = z + xyz + xyz 7 . 

42. t r = xyz, a = x 7 yz + xy 'z + xyz. 

In Exercises 43 to 46, wr/to /7?<? Boolean expression in disjunctive normal form. 
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43. (x' + y)' + yz'. 44. (x + y + z)(x' + y + z). 

45. (x + y’ + z)(x' + y + z). 46. y(x + yz'). 


3.3 Finding Minimal Disjunctive Forms 

Karnaugh Maps 

In this section we outline a technique to find a minimal disjunctive form for any 
Boolean expression. We first convert the expression into a disjunctive form. If we 
find all prime implicants of this form, then the expression is equal to the sum of 
those prime implicants. 

Interestingly, the method we use to find prime implicants operates on a disjunc¬ 
tive normal form. Paradoxical as it may seem, the first step in finding the simplest 
form is to convert to a form that will usually be less simple. 

The Karnaugh map of a disjunctive normal form is a diagram in which the funda¬ 
mental products are represented by small square cells. It is something like a stylized 
map of a country, where the cells represent states. We shall call two cells adjacent if 
they have a common border line (not just a single point). So two cells are adjacent if 
they differ only because one of the literals is complemented in one case and not com¬ 
plemented in the other. By looking for these adjacent pairs of cells you can reduce 
the number of terms in the form: for example, if both xyz and xyz' are represented, 
then the form contains xyz + xyz', which can be replaced by xy. Karnaugh maps 
are useful in expressions involving as many as four variables. 


Two-Variable Karnaugh Maps 

Two-variable Karnaugh maps are not particularly useful, but the ideas we see here 
will be used in three-variable and four-variable cases. 

A two-variable Karnaugh map is drawn in a 2 x 2 array of squares. The two 
horizontals, or rows , represent one variable, while the verticals, or columns , represent 
the other. We shall assume the variables are x and y, and associate x with rows and 
y with columns; the first (upper) row corresponds to x and the second to x ', while 
the first (left) column corresponds to y and the second to y Each cell represents the 
corresponding fundamental product. So the correspondences are as follows. 


x 


xy 

xy ' 


x'y' 
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For example, the form xy + x'y' corresponds to a map with the upper left and 
lower right cells. For our purposes, rather than deleting the missing cells, it will be 
more convenient to draw the whole diagram, with the required cells occupied by a 1: 

y y' y y' 


X 


is represented by 


1 



1 


Sample Problem 3.11. Draw Karnaugh maps for xy -\- x'y and x'y-\-x'y'. 

Solution. 


y y' y y' 


Practice Exercise. Draw Karnaugh maps for xy and x'y + xy'. 

Once the Karnaugh map is drawn, it is easy to simplify the expression. Look for 
pairs of adjacent occupied cells. Each pair corresponds to a sum (such as xy + x 'y) 
that can be reduced to a simpler single term (y in the example). 

If the whole 2x2 rectangle contains all 1 ’s, then it will be called a basic rectangle. 
Each adjacent pair that both contain l’s will also be called basic, unless the whole 
rectangle is basic, and any cell with a 1 that has no 1 adjacent to it is also called basic. 
In other words, a rectangular area in which all cells contain 1 will be called basic 
unless it is contained in some larger rectangle of all l’s. Then each prime implicant 
is the sum of the elements in a basic rectangle. For example, in Sample Problem 3.11, 
the basic rectangle joining x 'y and x'y' gives prime implicant x'y + x'y' = x'. 

To find a minimal form of a Boolean expression, first put the expression in dis¬ 
junctive normal form and draw the Karnaugh map. Identify the basic rectangles. 
Among the basic rectangles, find a set that between them covers every 1, with no 
unnecessary rectangles; this is called a minimal cover. The minimal form is the sum 
of the basic rectangles in a minimal cover. 

The “no unnecessary rectangles” condition is not important in the two-variable 
case, but we shall return to it later. In cases with more variables, there can possibly 
be more than one minimal form. 

Sample Problem 3.12. Use Karnaugh maps to find minimal forms for xy + x 'y, 

xy + x'y + xy ', and xy + x'y + xy' + x'y'. 


y y 

7 1 


*'IU I U 


Solution. The maps for the three forms are 

y y’ y y’ 


PI 

1 ) 

Iff 




rn 




Lu 




X 

1 


X 



x' 

1 


x' 

1 

1 
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respectively. The basic rectangles are marked. (Notice that two rectangles overlap 
in the second diagram.) These give the expressions xy + x'y = y, xy + x 'y + 
xy' = x + y , and xy + x'y+xy' + x'y r = 1. 

Practice Exercise. Find minimal forms for xy x'y' and xy + xy'. 

In the diagram for xy + x'y + xy r we did not circle the single 1 corresponding 
to xy ' because that lxl rectangle is enclosed in a 2 x 2 basic rectangle. 


Three-Variable and Four-Variable Maps 

Three-variable maps use the eight-cell pattern 

yz yz' y'z' y'z 
x 
x' 


They are analyzed similarly to two-variable maps, with the additional convention 
that cells in the first and last columns are treated as adjacent if they are in the same 
row (“rows wrap around”). Because of the larger size, larger basic rectangles are 
possible: sizes 1x4 and 2x4 can occur. But 1x3 rectangles are not considered basic 
because they do not lead to simplification. A 1 x 3 rectangle gives two overlapping 
1x2 rectangles. For example, the map for xyz + xyz ' -\- xy'z' is 


x 

x' 


yz yz' y'z y'z 


(1 

Cl) 

1) 







and the minimal form is xy + xz 

Four-variable maps use the 16-cell pattern 



The wrap-around property is applied to columns as well as rows, and 4 x 1,4x2, 
and 4x4 basic rectangles are possible. 

Sample Problem 3.13. Use Karnaugh maps to find minimal forms for xyz + 
x'yz'+ xyz'+ x'y'z, xyz + xy'z + xy'z' + x'yz + x'y'z, xyz + xyz'+ 
xy'z' + x'y'z + x'y'z'. 
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Solution. The maps for the first two forms are 


yz 

yz' 

y'z' 

y'z 


yz 

yz' 

y'z' 

y'z 

(i 

[i| 



X 

\ 

1 


(i 

[0 


vL 


0 

x' 




li 


respectively. The basic rectangles are marked. Notice that two 1x2 rectangles 
and two 2x1 rectangles are not marked in the second diagram. They are not basic 
because they are in the 2 x 2 rectangle. In the first case the prime implicants are 
xyz + xyz ' = xy , xyz' + x'yz' — yz' and x'y'z . So the minimal form is 

xyz + x 'yz' + xyz ' + x' y' z — xy + yz' + x' y' z. 

In the second case the prime implicants are xy 'z + xy'z' — xy' and (xyz + 
xy 'z) + (x 'yz + x'y'z) = xz + x'z = z and the minimal form is 

xyz + xy'z + xy'z' + x'yz + x'y'z = xy' + z. 

In the third example we obtain 


x 

x' 


yz yz y z y z 


(i 

CO 

0 




uJ 

i) 


Not all of these basic rectangles are necessary. We can omit either the rectangle 
joining xyz ' to xy 'z', or the one joining xy ' z' to x 'y 'z ', and all l’s will still be 
covered. So we can use either of the diagrams 


x 

x' 


yz yz y z y z 


: i 

CD 

i) 




0 

l) 


yz yz y z y z 


(1 

1) 

fl) 




(lj 

1) 


and obtain two different minimal forms: 

xyz + xyz ' + xy' z + x' y' z + x' y' z' = xy + y' z' + x' y ', 
xyz + xyz 7 + xy'z' + x'y'z + x'y'z' = xy + xz' + x'y '. 

Sample Problem 3.14. Use Karnaugh maps to find minimal forms for a = 
xyz 't + xyz 't' + xy 'zt + xy 'zt' + x 'yz't + x'yz't' + x 'y 'zt + x 'y 'zt' and 
/3 = xyzt+xyz't+xy'zt+xy'z't+xyzt' +xyz't' +xy'zt'+xy'z't'+x'yz't'. 


Solution. The maps for the two forms are 
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zt zt' z’t’ z't 
xy 
xy' 

xy 

x'y 




i 

iJ 

fl 

1 N 




[l 

K 






fi 

n 


zt zt' z't' z't 
xy 
xy' 

x'y ' 
x'y 



fi 

1 

1 

r 



a 

1 


(T 


i. 













respectively. The basic rectangles are marked. We get a — yz' + y 'z and fi = 
x + y'z't'. 

Sometimes there can be a large number of basic rectangles, and care must be 
taken to find a minimal cover. 


Sample Problem 3.15. Use Karnaugh maps to find a minimal form for xyzt + 
xyzt r +xyz f t r +xy 'z f t r +x 'y r zt+x 'y'zt r +x 'y 'z 't '+x f y 'z 't+x 'yzt +x 'yzt’. 


Solution. The map is shown in the left-hand part of the following diagram, and 
six basic rectangles are marked. However, on examination we see that three basic 
rectangles cover all the l’s, as shown in the right-hand figure. 


zt zt' z't' z't 
xy 
xy' 
x'y' 
xy' 


l 1 

(iJ 


(i)) 





m 


M 

o 


jL 

i) 

y_ 

A 




zt zt' z't' z't 
xy 
xy' 
x'y' 
xy' 


11 

iJ 


T 







(i 

1 

1 

i) 

i 





So the minimal form is xz't' + x'y' -\- yz. 


Exercises 3.3 

Use Karnaugh maps to find minimal forms for the two-variable expressions in 
Exercises l to 6. 

1. xy-\-xy'. 2. xy' -\- x'y -\- x'y'. 

3. xy-\-x'y-\-x'y'. 4. x'y-\-xy'. 

5. xy' -\-x'y'. 6. xy-\-xy'-\-x'y. 

7. What is the fundamental product represented by each of the following Karnaugh 
maps? 



yz 1 

y’z’ 

y'z 

(ii) 

yz 

yz 1 

y'z' 

y'z 


(i 

o 


X 

0 



a 





x' 






x 
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8. What is the fundamental product represented by each of the following Karnaugh 
maps? 


(i) yz yz' y f z' y'z 


0 



1 

ij 



u 


(ii) 


yz yz' y'z' y'z 



fl 

n 



u 

ij 



Use Karnaugh maps to find minimal forms for the three-variable expressions in Ex¬ 
ercises 9 to 20. 

9. xyz + xyz r +xy f z. 

10. xyz + xyz ’ + xy'z' + x’y’z. 

11 . xyz' -\-xy'z-\-xy'z' -\-x'yz-\-x'yz' -\-x'y r z. 

12. xyz' + xy'z' + x'yz + x'y' z. 

13. xyz + xyz f + x'yz + x'y 'z. 

14. xyz ' +xy'z + x'yz + x'yz'- 

15. xyz + xyz' + x'yz'. 

16. xyz ' + xy'z + xy'z' + x'yz' + x'y'z. 

17. xyz + xyz ' + xy'z + xy'z' + x'y'z. 

18. xyz + xyz' + xy'z + xy'z' + x'yz + x'yz' + x'y'z + x'y'z '. 

19. xyz ' + xy'z + xy'z' + x'yz + x'y' z. 

20. xy'z + xy'z' + x'yz + x'y'z'. 

21. What is the fundamental product represented by each of the following Karnaugh 
maps? 

(i) zt zt ' z’t’ z't (ii) 

xy 
xy' 
x'y' 
x'y. 


xy 

xy' 

x'y' 

x'y 








(i 

0 










zt zt' z't' z't 


n 



fi 

1 



1 

1 



1 

u 



u 


22. What is the fundamental product represented by each of the following Karnaugh 
maps? 
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(ii) zt zt' z't' z't 
xy 
xy' 

xY 

x'y 




fl 

f 





1 

1 





1 

1 





d 

l 




Use Karnaugh maps to find minimal forms for the four-variable expressions in Ex¬ 
ercises 23 to 30. (There will sometimes be more than one correct answer .) 

23. xyzt + xyz 't + xyzt' + x 'yzt + x 'y 'zt + x 'yzt '. 

24. xyz f t + xy 'zt + xy'z't + x'yzt + x'yz't + x'yzt' -\- x'y'z't'. 

25. xy'z't +xy'zt' + xy'z't' + x'y'zt + x'y'z't +x'y'zt' + x'y'z't'. 

26. xyz't' +xy'zt' + xy'z't' + x'yzt +x'y'zt +x'yzt' +x'yz't'. 

27. xyzt + xy'z't + xyzt' + xy'z't' -\-x'yzt +x'yz't -\-x'yz't' -\-x'y'zt'. 

28. xy'z't +x'y'zt + x'y'z't +x'yzt' + x'yz't' +x’y'zt'. 

29. x 'yzt + x 'yz't + x 'y 'zt -\-x'y'z't-\-x 'yzt' -\-x'yz't' -\-x'y'zt' x'y'z't'. 

30. xyz 't + xyzt' + xy'zt' + xy'z't' + x 'yzt' -{-x'yz't'. 

31. In each case, write down the disjunctive form of the Boolean expression repre¬ 
sented by the given Karnaugh map, and write down a minimal form for it. 



zt 

zt' 

z't' 

z't 

(ii) 

zt 

zt' 

z't' 

z't 

xy 




1 

xy 



1 

1 

xy' 


1 

1 


xy' 

1 

1 

1 

1 


1 

1 





1 

1 

1 

x'y 

1 




x'y 



1 



32. In each case, write down the disjunctive form of the Boolean expression repre¬ 
sented by the given Karnaugh map, and write down a minimal form for it. 


(i) 

zt 

Zt' 

z't' 

z't 

(ii) 

zt 

zt' 

z't' 

z't 

xy 


1 



xy 


1 



xy' 


1 

1 

1 

xy' 

1 

1 

1 


x'y' 

1 

1 

1 


x'y' 


1 

1 

1 

■Vv 

1 

1 

1 


x'y 



1 



33. In each case, write down the disjunctive form of the Boolean expression repre¬ 
sented by the given Karnaugh map, and write down a minimal form for it. 
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zt 

zt' 

z't' 

z't 

(ii) 

zt 

zt' 

z't' 

z't 

xy 



1 

1 

xy 

1 

1 



xy' 

1 

1 



xy 

1 

1 

1 

1 

x'y ' 

1 

1 



x'y' 



1 

1 




1 


x'y 

1 

1 




34. In each case, write down the disjunctive form of the Boolean expression repre¬ 
sented by the given Karnaugh map, and write down a minimal form for it. 


(i) 

zt 

zt' 

z't' 

z't 

(ii) 

zt 

zt' 

z't' 

z't 

xy 

1 



1 

xy 



1 

1 

xy' 


1 

1 


xy' 

1 

1 

1 

1 

x'y' 



1 


x'y' 

1 



1 

x'y 

1 



1 

x'y 

1 


1 



In Exercises 35 to 40, convert the Boolean expression to a disjunctive normal form 
and find a minimal form. 

35. xyz + xyz ' + x 'z' + x r y 'z + x 'yz 36. xy' + xyz' + x r yz '. 

37. (x + y)'(z'+ *')'. 38. X y + x’(y f + z f )+ yz’. 

39. v '(yz + zt + ty). 40. xyz't + xy 'zt' + xyt' + zt f + xz 


3.4 Digital Circuits 

Doing Arithmetic with Circuits 

Computers use electricity, and as such they are essentially binary, either a switch is 
open or it is closed; either there is current in a wire or there is not. As we pointed out 
in Section 1.5, this means that computers use binary arithmetic. So it is necessary to 
build electric circuits that carry out pieces of arithmetic. 

The typical circuit consists of a number of inputs, a processor, and a number of 
outputs. For example, suppose you have a circuit that carries out addition. Specifi¬ 
cally, suppose it takes two variables x and y, each of which can equal 0 or 1, and 
outputs the value z = x + y. As z could have two binary digits, we shall write it 
as t\to, the usual binary representation of the decimal number 2t\ + to. Now all the 
quantities involved have value 0 or 1, so each can be represented by a single electrical 
entity, such as a wire (carrying current or not) or a switch (open or closed). 

We represent each variable as a line; you may wish to think of it as a physical 
wire. Then the addition process looks like 
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y 



xy 



x 




AND gate 


OR gate 


NOT gate 


Fig. 3.1. The three basic gates 


* - - 

y —_— *i 

The x and y wires either contain current (variable = 1) or not. The current in the to 
and t\ wires will depend on the outcome of the addition. The question is, what goes 
on in the gray box? 

Suppose v, y, to, and t\ are treated as Boolean variables. Then t\ will be 1 if both 
v and y are 1 and 0 otherwise. So t\ = xy. Similarly, to will belifx = l,y = 0 
or v = 0, y = 1 and 0 in the other cases. After a little thought we see that to = 
x'y xy'. So the arithmetical processes can be represented by Boolean operations. 
If we use the Boolean algebra £> 2 , defined in Section 3.1 then no values other than 0 
and 1 ever appear. 


The Building Blocks 

The circuits we consider are called digital circuits because they deal only with inputs 
that are single binary digits. They are also called logic circuits. 

The simplest component circuits are called gates. All digital circuits can be built 
up using three fundamental building blocks, the AND, OR, and NOT gates. The 
standard symbols for these gates are shown in Figure 3.1. 

The AND gate takes as its input two variables, x and y, and outputs the value xy. 
This is easily realized physically by a series circuit. The AND gate has input x , y 
and output x + y, while the NOT gate (or inverter) has input x and output x '. 

Larger circuits are formed by putting these components together. For example, 
the following circuits produce x 'y and xy 



These can be combined as follows to give a circuit to produce x'y + xy'. 
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x'y+xy' 


Observe the use of a dot ( splitter ) to show that two lines carry the same charge, and 
note that we allow two lines to cross without interacting (just as two insulated wires 
could cross, one on top of the other). 

Sample Problem 3.16. Construct a circuit to implement xy + z. 

Solution. An appropriate circuit is 



Practice Exercise. Construct a circuit to implement x' + yz. 


Half Adders 


We continue with the example of addition. A circuit that carries out the addition of 
two one-bit variables is called a half adder. Following from what we have done so 
far, it is easy to construct a half adder: take the circuit for x'y + xy' and attach to it 
a circuit to produce xy 



x'y+xy' 


xy 


However, a simpler version is available. The given half adder involves six gates 
(three AND, one OR, and two NOT gates). But 

x 'y + xy' = x 'y + xx' + xy' (as xx ' — 0) 

= x'y +x(x' + y') 

= x{x' + y') + yx' 


(reordering) 
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= x (x' + y ') + y (x' + y') (adding 0 again) 

= (.X + y){x' + y') 

= (x + y)(xy )' (de Morgan’s laws). 

This expression is simpler to evaluate. First we find (x + y) (one OR gate) and (xy )' 
(an AND and a NOT). Then we find their product (another AND). Then we are 
finished because the product xy was already computed earlier in the process. So the 
number of gates is reduced to four. The corresponding circuit is 



xy 

x'y+xy' 


Full Adders 

Of course, addition in a computer involves summing numbers with more than one 
binary digit. The way we do addition by hand is as follows: add the units, write down 
the unit digit, move the carry digit to the tens column; add the tens (there are now 
three figures to add), write down the new tens digit, move the carry to the hundreds 
column; and so on. Computers add in the same way. So we need a circuit to add three 
binary digits. Such a circuit is called a full adder. 

A full adder could be constructed from half adders, but the circuit is cumbersome. 
Suppose we want to find x+y+z. We could find x+y, getting the binary number t\ to 
(this takes one half adder, or four gates), then find z + to, getting say w\wo (another 
half adder), and finally add t\ to w\. This latter method is a little simpler because t\ 
and w\ cannot both be 1, but still require an OR gate. So nine gates are required. 

There is a simpler way to find x + y + z in binary notation. The units digit will 
be 1 if exactly one of x, y, and z equals 1, or if all three are 1. The “two’s” digit will 
be 1 when two or three of the summands is 1. So the units digit is 

to = x r y 'z + x 'yz' + xy 'z' + xyz, 


and the two’s digit is 

t\ — x 'yz + xy 'z + xyz ' + xyz. 

These two expressions can be simplified as follows. First we observe that 
to = x\y f z + yz') +x(yV + yz), 
and 

t\ = yz(x +x r ) +x(y'z + yz') = yz + x(y'z + yz'). 
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The expression a = y 'z + yz ' appears in both equations. Moreover, it is not hard to 
see that a ' — y 'z ' + yz. So 

to=x'a+xa\ t\=yz + xa'. 


So we first form the sum of y and z with a half adder. The units term of this is a , 
and the two’s term is yz. If we now add this to x, using a half adder, the unit’s term 
is to and the two’s term is xa. Finally, t\ is produced by putting this value xa and the 
value yz through an AND gate. The circuit is 



units 

twos 


where the square is shorthand for a half adder whose units output is marked by an 
asterisk. This involves a total of nine gates. 

A computer can carry out extended addition by the repeated use of full adders. 


General Circuits 

In general, it is more desirable to build a circuit with fewer gates. To implement any 
Boolean expression, a first step is to replace the expression by its minimal form. 
Then, if possible, make any other transformation to reduce the number of gates— 
usually by using de Morgan’s laws to reduce the number of NOT gates. 

Sample Problem 3.17. Find a circuit to implement xyzt + xyz 't + xy 'zt + 
xy'z't +xyzt' +xyz't' +xy'zt' +xy'z't' +x'yz't'. 

Solution. In Sample Problem 3.14 we saw that the given expression has minimal 
form x -\- y'z't' . To cut down the number of gates, we note that y 'z 't ' — (x + 
y + z)So a suitable circuit is 

y 

z 

t 



Practice Exercise. Find a circuit to implement 


xyz 't + xyz r t' + xy 'zt -\- xy' zt' x 'yz 't-\-x'yz't'-\-x 'y 'zt + v 


'y'zt' 


(see Sample Problem 3.14 again). 
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Exercises 3.4 

1. Verify that if a = y 'z + yz / , then a' = y 'z ' + yz. 

In Exercises 2 to 8, draw circuits to implement the given expression. 

2. x + y + z. 3. xyz. 

4. xyz'. 5. x + yz'. 

6. xy + z'. 7. (x + yz)' + y. 

8. (xy)' + (x + z)'. 

In Exercises 9 to 12, draw a circuit to implement the given expression exactly as it is 
written. Then try to find a simpler circuit to implement the expression. 

9. (xy')'z. 10. xy' + (x+z)'. 

11 . xy' + (xy)', 12 . xy'z + x'yz. 

13. Find a Boolean expression corresponding to the circuit 



14. Find a Boolean expression corresponding to the circuit 



15. Find a Boolean expression corresponding to the circuit 
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16. Find a Boolean expression corresponding to the circuit 



17. Find a Boolean expression corresponding to the following circuit. Simplify the 
expression, and find a simpler circuit that achieves the same result. 
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Relations and Functions 


Functions are important in many areas of mathematics. Elementary algebra starts to 
differ from arithmetic when the concept of a function is developed. Calculus can 
be viewed as being the study of functions, and of certain ways of associating new 
functions with a given one. 

We start by discussing binary relations. These can be seen as a generalization of 
functions, and we lead into functions by looking at special relations. 

The reader who has already started to study calculus will find that we focus on 
slightly different aspects of functions. Our viewpoint may be less algebraic than what 
you have seen before. 


4.1 Relations 

First Definitions 

A ( binary ) relation p from a set S to a set T is a rule that stipulates, given any 
element s of S and any element t of T, whether s bears a certain relationship to t 
(written s pt) or not (written s X> t). For example, if S is the set of living males and 
T is the set of living females, the relation p might be “is the son of;” if s denotes a 
certain man and t denotes a certain woman, we write s p t if s is the son of t, and 
s)pt otherwise. 

Binary relations are very common in mathematics. We have already seen rela¬ 
tions such as <, = and <, and all of these are relations from M to M (or from Z to Z, 
or ...). The relation |, or “divides,” is a relation from Z + to Z + , or from Z + to Z. If 
V and Q are any collections of sets, then c and D are relations from V to Q. 

Alternatively, we can define a binary relation p from the set S to the set T as a set 
p of ordered pairs (s, t), where s belongs to S and t belongs to T, with the notation 
that s pt when (s, t) e p and s)pt otherwise. This means that, formally, a binary 
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relation from S to T can be defined as a subset of S x T, the Cartesian product of S 
and T defined in Section 2.4. 

Sample Problem 4.1. Suppose S = {1, 2, 3} and T = {1, 2, 3, 4}. Find the sets 
corresponding to the binary relations s < t and s 2 = t, in the usual arithmetical 
sense. 

Solution. The relation set corresponds to the set L, defined by 
L = {(1, 2), (1, 3), (1,4), (2, 3), (2,4), (3,4)}. 


s 2 = t has 

R = {(1,1), (2,4)}, 

as its corresponding set. 

Practice Exercise. What are the sets corresponding to s < t and s >tl 


New Relations from Old 

Sometimes it is useful to be able to combine relations. For example, consider the 
relations “is the father of” and “is the mother of.” The relation “is a parent of” could 
be interpreted as meaning that one or the other of these two relations holds. If spt 
means “s is the father of t” and sat means “5 is the mother of t” then s(p or a)t 
would mean “5 is a parent of t.” 

This sort of combining can be applied to binary relations in general. If we have 
two binary relations p and a , each from S to T, we can define two further binary 
relations, their union p U a and their intersection p Da, from S to T in the following 
way: 


s (pU a)t if and only if s p t or s a t or both; 

s(pHa)t if and only if both s p t and s a t. 

In other words, (s,t) e p U a if and only if (s, t) e p or ( 5 , t) e a or both, and 
(s,t) e p H a if and only if (s, t) e p and (s,t) e a , so that the notation agrees 
exactly with the usual set-theoretic notation. 

We can also construct new relations from old ones by composition. If r is a binary 
relation from R to S , and a a binary relation from S to T, then the composition r a 
is a binary relation from R to T, where r(za)t if and only if there exists at least one 
element s e S such that rxs and sat. 

If r is a binary relation from R to S, and v a binary relation from P to T, then 
the composition tv is not defined unless P = S. 
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Sample Problem 4.2. Suppose R = {1,2}, S = {3, 4, 5}, and T = {6, 7}. The 
binary relations t from R to S, p from S to T, and o from S to T are defined by 

X = {(1,3), (2, 4), (2, 5)}, 

P = {0, 6), (3, 7), (5, 7)}, 
a = {(3,6), (4, 6), (4,7)}. 

What are the binary relations t p, po and p C la? 

Solution. The relations are as follows: 

xp = {(1,6), (1,7)}, 

pa is not defined as T is not equal to S, 

pDa = {(3,6)}. 

Practice Exercise. What are the binary relations pz,za and p U a? 

Theorem 21. Composition of binary relations is associative: if a, /3, and y are given 
binary relations and if either of the compositions a(/3y) and ( a/3)y exists, then so 
does the other, anda(fy) = ( a/3)y. 

Proof. Let Q , R, S , T be sets where a a c Q x R, /3 R x S, and y c S x T. 
Suppose there are some elements q e Q and t e T that satisfy q (a/3) yt. Then 
by the definition of composition there exists an s e S such that qaf> s and sy t. 
Again by the definition, there exists r e R such that qar and r /3 s. But now since 
r /3 s and sy t, it follows that r f3y t, and since qar , we have q a(/3y) t. Hence 
(af3)y c a(/3y). A similar argument shows that a(/3y) c (a/3)y and therefore the 
two compositions are equal. □ 

Every binary relation has an inverse relation : if or is a binary relation from S to 
T , so that a c S x T, then a~ l is a binary relation from T to S , or a~ l c T x S, 
defined by 

a -1 = {(t, s) : (s, t) e a}. 

In other words, t a~ l s if and only if sat. 

Sample Problem 4.3. What is the inverse relation ofz, in the preceding Sample 
Problem? 

Solution, r- 1 = {(3, 1), (4, 2), (5, 2)}. 

Practice Exercise. What is the inverse relation of p in the preceding Sample 
Problem? 


The inverse of a composition may be expressed as a composition of inverses. 
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Theorem 22. If a and ft are binary relations from R to S and from S to T, respec¬ 
tively, then (a/3)~ l = 

Proof. aft is a binary relation from R to T, and hence ( af)~ l is a binary relation 
from T to R. 

Suppose (t , r) e (aft)~ l . Then (r, t) e afi. Hence for some s e S, we have 
(r, s) e a and (5, t) e ft, so (t, 5) e and (5, r) e a -1 . So (t, r) e and 

therefore, ( aft) _1 C 

Reversing the argument at each step shows that p~ x a~ l c (a/3) -1 , so the two 
relations are equal. □ 


Relations on a Set 

Suppose a is a binary relation from a set A to A itself or, in other words, a c A x A. 
Then we say that a is a binary relation < 9/2 A. One special relation on a set is the 
identity relation: 1 a is the identity relation on A if a la b is true precisely when b = a, 
that is, la = {( 0 , a) \ a e A}. 

There are several important properties that a relation on a set may (or may not) 
have. Suppose a is a relation on A. 

(i) a is called reflexive if and only if a a a for every a e A, or, as we would 
usually say, aaa is always true, a is irreflexive if and only if a a a is never true for 
any element a of A. 

(ii) a is called symmetric if and only if a a b implies baa or, in other words, 
if (a, b) E a implies (b, a) e a. Since, by the definition of inverse relation, we 
know that a a b if and only if b a~ l a, we can say that a is symmetric if and only 
if a = a~ l . a is called antisymmetric if and only if a a b and baa together imply 
a = b or, in other words, a D a~ l c la- If cl a b and baa can never both be true, 
then a is called asymmetric , or sometimes skew. (If aab and baa are never both 
true, then (aab Aba a —>► aaa is vacuously true, so all asymmetric relations are 
antisymmetric, but this gives us no information.) 

(iii) a is called transitive if and only if aab and bac together imply a a c, and 
atransitive if and only if, whenever aab and bac , a ty c, or in other words, if and 
only if a Pi aa = 0. 

Sample Problem 4.4. What are the reflexive, symmetric, and transitive pro¬ 
perties of < and < on the set Z? 

Solution. < is irreflexive (a < a is never true) while < is reflexive (a < a 

is always true). Both relations are antisymmetric, but < is, in fact, asymmetric. 

Both are transitive. 
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Practice Exercise. What are the reflexive, symmetric, and transitive properties 
of = and > on Z? 

The relation of adjacency on the set of integers, defined by i a j if and only if 
\i — j\ = 1 for i and j positive integers, is irreflexive (since \i — i\ = 0), symmetric 
(since | i — j | = | j — i |), and not transitive (since 1 a 2 and 2 a 3 but 1 ty 3). It is in 
fact atransitive for i a j and j ak together imply i = j zb 1 and k = j d= 1, so i = k 
or i = k zb 2, but i cannot equal c zb 1. So a ty c. 

The properties of relations can be exhibited even in quite small examples. For 
example, say A = {1, 2, 3} and consider the relation p on A, defined by 


= {( 1 , 1 ), ( 2 , 2 )}. 

Since la 2 >0, it is not reflexive, but since la H P / 0, it is not irreflexive either. 
Since P = /3~ l , it is symmetric, and since ft D c cA, it is also antisymmetric. 
Since /3/3 = it is transitive. 

If a and are relations on A and c a, then we say a covers Some of the 
properties we have defined can be described in terms of covering. For example, if 
a is a relation on A, then a is reflexive if and only if it covers la . By the definition 
of composition we know that aab and bac together imply that a aa c, so a is 
transitive if and only if it covers aa. 

Sample Problem 4.5. Let A = {1, 2, 3} and define the relation y on A by 
y = {(1,1), (2, 2), (3, 3), (1,2), (2, 3)}. 

What are the reflexive, symmetric, and transitive properties of y ? 

Solution. Since la 2 y, Y covers la , so it is reflexive; since y~ l = {(1, 1), 
(2, 2), (3, 3), (2, 1), (3, 2)} and y~ l D y = la, it is antisymmetric; since 1 y 2 
and 2 y 3 but 1^3, it is not transitive, but since y c yy, it is not atransitive 
either. 

Practice Exercise. Let A = Z U {a/ 2} and define the relation 8 on A by a 8 b if 
and only if a + b e Z. Show that 5 = Z x Z. What are the reflexive, symmetric, 
and transitive properties of 5? 


Exercises 4.1 

In Exercises 1 to 6, a binary relation from S = {1, 2, 3, 4, 5, 6, 7, 8, 9} to itself is 
defined. For each of these relations, what is the corresponding subset of S x S? For 
each relation, what is the inverse relation? 

1. xay means v = y 2 . 

2 . x f y means x + y = 9. 
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3 . x y y means x divides y. 

4. x by means 2x < y. 

5. x e y means x 2 + y = 12. 

6. x 0 y means x — v = 0. 

7. Relations a and f> are binary relations from S = {1, 2, 3} to T = {1, 2, 3, 4} 
defined by 

xa y means x = y 2 , 
x /3 y means x = y — 1. 

For each of these relations, what is the corresponding subset of S x T1 

8. Suppose p and o are relations defined on S = {1, 2, 3, 4} by the sets 

p = {(2, 2), (3, 2), (3, 4), (4, 1), (4, 3)}, 
or = {(2, 3), (1,2), (3,1), (3, 2), (4, 4)}. 

Find the set form of the composition per. 

9. Given p and a as in Exercise 8, what is the set form of the relation crp? 

10. Given p and a as in Exercise 8, what are the set forms of the relations pp and 
crcrl 

11. Given p as in Exercise 8, what is the set form of the relation ppp? 

12. Let [i and v be two binary relations from the set Z of integers to itself, defined 
by 

/U. = {(m, 2m) | m e Z}, 

and 

v = {( n , 3 n) | fl g Z}. 

What is the composite relation jivl 

13. Let a and be any two given binary relations from A to B such that a c or, 
in other words, such that a a b implies a /3 b. 

(i) Show that if y is a binary relation from B to C, then ay c /3y. 

(ii) Show that if 8 is a binary relation from D to A, then 8a c 8/3. 

In Exercises 14 to 20, prove that the indicated identity holds for any binary relations 
a and /3 from A to B and y from C to A. 

14. (a -1 ) -1 = a. 

15. a~ l = if and only if a = f3. 

16. a~ l C /3~ l if and only if a c j3. 

17. (a D /3) -1 = a~ l H /3~ l . 

18. (aU/3)- 1 =a- 1 U/3~ l . 
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19 . y(a U fi) = ya U y/3. 

20 . y(a n/3) <Oy a n y/3. 

21 . If we try to prove equality in Exercise 20, where does the proof break down? 
(Hint: consider the sets A = {2, 3}, B = {4}, C = {1}, and the relations y = 
{(!> 2), (1, 3)}, a = {(2, 4)}, f$ = {(3, 4)}.) 

22 . The relation p is defined on {a, b, c, d) by 

p = {(a, a), ( b,b ), (b,c), (c, c), (c, d), (d,c)}. 

Prove that p has none of the properties: reflexive, transitive, symmetric, antisym¬ 
metric. 

23 . If a is a relation from A to B, then we can treat a b as defining a new binary 
relation ty, called the complement of a , from A to B. In terms of sets, = 
(A x B)\a. Show that ty -1 is the complement of a -1 . 

24 . Consider the relations a and of Exercise 7. What are the sets and $ ? 

25 . Define the relations a, /3, and y on the set {1, 2, 3} by the sets 

Of = {(1,1), (2, 2), (3, 3)}, 

P = {(1, 3), (2, 2), (3,1)}, 
y = {(2, 3), (1,2), (1,3), (3, 2)}. 

Which of these relations are 

(i) reflexive, (iv) antisymmetric, 

(ii) irreflexive, (v) asymmetric, 

(iii) symmetric, (vi) transitive? 

Exercises 26 to 33 show relations from the set Z + of positive integers to itself De¬ 
termine whether each relation has each of the following properties: reflexivity, sym¬ 
metry, antisymmetry, transitivity. 

26 . m is divisible by n. 27 . m + n > 50. 

28 . m + n is even. 29 . m + n is odd. 

30 . mn is even. 31 . mn is odd. 

32 . 31 (m + n). 

33 . m equals some positive integer power of n. 

34 . A group of people, denoted by A, B, , are at a party. Consider the following 
relations: is each reflexive, symmetric, antisymmetric, asymmetric, or transitive? 

(i) A a B means A and B spoke to each other at the party. 

(ii) A f B means A spoke to B at the party. 
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(iii) Ay B means A and B come from the same city. 

(iv) A 8 B means A and B met for the first time at the party. 

(v) A € B means A is the father of B. 

35. Find the error in the following “proof.” 

Theorem. A symmetric and transitive relation must be reflexive. 

Proof. Let a be a symmetric and transitive relation on a set A. If a a b, then 
baa because a is symmetric. But aab and baa together imply aa a since a 
is transitive. Therefore a is reflexive. 

36. Show that if a is reflexive (respectively, irreflexive, symmetric, antisymmetric, 
transitive, atransitive), then so is a -1 . 

37. Let a and /3 be relations on a set A. Show that 

(i) if a and /3 are reflexive, so is a/3, 

(ii) if a and /3 are symmetric, then a/3 is symmetric if and only if aft = Pa, 

(iii) if a and /3 are antisymmetric, a/3 is not in general antisymmetric, 

(iv) if a and /3 are transitive, a/3 is transitive if a/3 = Pa. 


4.2 Some Special Kinds of Relations 

Equivalence Relations 

Let a be a relation on a set A. Then a is said to be an equivalence relation on A if 
and only if it is reflexive, symmetric, and transitive. 

The obvious equivalence relation is equality, on any set. In sets other than num¬ 
ber sets, equal objects are often called “equal in all respects.” More generally, an 
equivalence relation can be considered as a statement that two objects are “equal in 
some (specified) respects.” One example, on the integers, is the relation a, where 
aab is true if and only if a = =b b — that is, a and b have the same absolute value. 
Another, which you have probably seen in high school geometry, is congruence, on 
the set of all plane triangles. 

The word “congruence” is also used for another important equivalence relation. 

Sample Problem 4.6. The relation “congruence modulo n ” for a positive inte¬ 
ger n is defined on the set Z of integers by specifying that a is congruent to b 
modulo n, written “a = b (mod^),” if and only if n \ {a — b). Show that con¬ 
gruence modulo n is an equivalence relation. 

Solution. Since n \ (a—a), the relation is reflexive; if n | ( a — b ) then n | (, b—a ), 
so the relation is symmetric; if n | (a—b) and n | ( b—c ) then we have a—b = kn, 
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and b — c = In, for some k,l e Z, hence a — c = (k + l)n and n \ (a — c), which 
shows that the relation is transitive. So congruence modulo n is an equivalence 
relation on the integers. 

Practice Exercise. Let S be the set of all geometrical solids. Write a a bio mean 
that a and b have the same volume. Show that a is an equivalence relation on S. 

The relation of congruence modulo a positive integer will be discussed at length 
in Section 9.2. 

Partitions of a set were defined in Section 2.2. There is a very important con¬ 
nection between equivalence relations and partitions. Suppose S is a set and a is an 
equivalence relation on S. Given s e S, we define 

[s] = {t : t 6 S, t a s}. 

Then [s] is called the a-class of s or the equivalence class of s (under a). By the 
reflexive law, [s] is never empty: it always contains s. 

If two or more equivalence relations are being discussed, we write [s] a to denote 
the equivalence class under the relation a. 

Sample Problem 4.7. What are the equivalence classes of congruence mod 5 
on Z? 

Solution, a = b means 5 | (a — b), so two numbers are in the same equivalence 
class if and only if they leave the same remainder on division by 5. So 

[a] = {..., a — 10, a — 5, a, a + 5, a + 10, ...}, 

which is sometimes abbreviated to a + 5Z. There are five classes, namely [0], 
[1], [2], [3], [4]. 

Practice Exercise. What are the equivalence classes of the relation a = ±b 
on Z? 

Theorem 23. Suppose a is an equivalence relation on S. Then 

(i ) s at if and only if [s] = [t], 

(ii) t e [s] if and only if [ t] = [s], 

(iii) given s,t e S, either [s] D [t] = 0 or [s] = [t], 

(iv) the set of disjoint equivalence classes partitions the set S. 

Proof, (i) Suppose sat. Let v e [s]. By the definition of equivalence class x a s, 
and hence by transitivity x at. Again by the definition of equivalence class, v e [t]. 
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Hence [ s] c [t]. But by symmetry, sat implies t a s, so a repetition of the above 
argument shows that [ t] c [s]. Hence [s] = [t]. 

Conversely, let [.s'] = [t]. By reflexivity s e [s] so s e [t ] and sat. 

(ii) Let t e [s]. This means that t c s and by (i) [t] = [s]. Conversely, if 
[t ] = [s], then by (i) t c s and we have t e [s 1 ]. 

(iii) Given s,t e S, either [s] Pi [t] = 0 or [s] D [t] / 0. In the former case, the 
theorem is satisfied; in the latter case, there must exist an element x e S such that 
x e [*] n [*]. By (iii), since v e [5] we have [x] = [s]. Similarly [x] = [t]. Hence 

[s] = M- 

(iv) For every s e S, we have s e [s 1 ]. Hence the union of all the equivalence 

classes contains the whole of S. But by (iii), distinct equivalence classes are disjoint, 
which proves the theorem. □ 

Theorem 23 shows that every equivalence relation on a set induces a partition of 
that set. Next we show that the converse of this theorem is also true. 

Theorem 24. Consider a partition of the set S given by 

S = 5iU---U5 n . 

Define a relation a on the set S as follows: sat if and only if s and t both belong to 
the same set Si of the partition. Then a is an equivalence relation on S. 

Proof. We must verify the three properties of an equivalence relation. First, since any 
element s belongs to exactly one set of the partition, we have s a s and the relation 
is reflexive. Next, if s at, then s and t belong to the same set of the partition, so that 
we also have t as and the relation is symmetric. Finally, if r as and sat, then r, s, 
and t must all belong to the same set of the partition, so that rat and the relation is 
transitive. □ 


Order Relations 

A relation a on a set A is called a partial order relation on A if and only if it is 
antisymmetric and transitive. If a is also reflexive, it is said to be a weak partial 
order; if it is irreflexive, it is called a strong partial order. 

We say two elements of a set A are comparable under a if either aab or baa is 
true. A partial order relation for which every pair of elements is comparable is called 
a total order. 

Familiar examples of order relations include the relations < and < on the set 
Z of integers (or on other sets of real numbers). Both relations are transitive; < is 
asymmetric and < is antisymmetric. So < is a strong partial order and < is a weak 
partial order. If a and b are real numbers and a ^ b, then either a < b or b < a, and 
similarly a < b or b < a, so both are total orders. 
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Another important example is the relation of containment, denoted by c, on the 
power set P(S) of a set S'. If A c B and KA, then A = B, so c is antisymmetric; 
if A c B and B c C, then A c C, so c is transitive. So containment is a partial 
order. Since A c A, containment is a weak partial order. If S contains at least two 
elements, say s 1 and ^ 2 , then {s\} £ {^ 2 } and {^ 2 } 2 {^ 1 }, so the order is not total. 
(However, if S is a one-element set, S = {s}, so that V(S) = {0, S}, then the order 
is total.) 

Sample Problem 4.8. Consider the relation \ on the set Z + of positive integers: 
a | b means that for some n e Z + , b = na. Show that \ is a weak partial order, 
and is not total. 

Solution. If a \ b and b \ a, then a = b, so | is antisymmetric. If a \ b and b | c, 
say b = n\a,c = n 2 b for some positive integers n\ and n 2 , then b = n\a,c = 
n\n 2 b and a | c, so | is transitive and is a partial order relation. Since a \ a, it is 
weak. Since 2 does not divide 3 and 3 does not divide 2, the integers 2 and 3 are 
not comparable, it is not a total order. 

Practice Exercise. Write alb to mean “a \ b but a £ bC Show that ? is a partial 
order on Z + . Is it weak or strong? Is it total? 

Suppose a is a relation on A and suppose a\ a < 22 , <22 01 < 23 , ..., ak -1 a ak and 
ak a a\ are all true, where the elements a\, < 22 , < 23 ,..., ak are all different. Then we 
say a contains a cycle of length k. 

Theorem 25. No order relation contains a cycle of length greater than 1. 

Proof. Let a be an order relation on the set A. We may have a a a for some a e A, 
and this is a cycle of length 1. There cannot be a cycle of length 2, for that would 
mean that aab and b a a for some a ^ b, contradicting the antisymmetry of a. 

Consider the set L of all lengths, other than 1, of cycles in a. Suppose L is not 
empty. Then we are led to a contradiction as follows. There must be a least element 
k g L, and because 2 ^ L, we know k > 3. So for some elements a\, < 22 ,..., ak 
of A, aia ci 2 , ci 2 0 t as, ..., ak -1 a ak, and aka a\, where k > 3 and the only shorter 
cycles have length 1. But a is transitive, so a ak -1 a and ak a a\ together imply 
ak~\ a a \. So a contains a cycle of length k— 1, and 2 < k — 1 < k. This contradicts 
the choice of k. So L = 0, and no cycles of length 2 or longer exist. □ 


Predecessor-Successor Relations 

Relations that are irreflexive, asymmetric, and atransitive are called predecessor- 
successor relations. An example is the relation “is the daughter of” on the set of 
all women. No one is her own daughter, so the relation is irreflexive. If Amy is the 
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daughter of Belle, then Belle can never be the daughter of Amy, so the relation is 
asymmetric. If Belle is the daughter of Claire, and Amy is the daughter of Belle, 
then Amy is not the daughter of Claire, so the relation is atransitive. 

Exercises 4.2 

In Exercises 1 to 4, is the relation a an equivalence relation on the set S? 

1. S = IZ, a a b means ab > 0. 

2. S = IZ, a a b means ab > 0. 

3. S is the set of all nonzero real numbers, a a b means ab > 0. 

4. S = Z+, a a b means a and b have greatest common divisor 16. 

In Exercises 5 to 7, is the relation ft a partial order on the set S? For each partial 
order state whether it is weak or strong, total, or not total. 

5. S = Z; a fib means a \ b. 

6. S = Z; a fib means a i b (as defined in Sample Problem 4.8). 

7. S is the English alphabet; a ft b means a comes before b is the usual alphabetical 

order. (Do not be confused; a and b do not necessarily mean the letters “a” and 

“b” here.) 

8. Which, if any, of the relations in Exercises 4.1.26 to 4.1.34 are equivalence rela¬ 
tions? 

9. Suppose A is the set Z x Z. Prove that the relation a on A, defined by 

(x,y)a(z,t) if and only if x + t = y + z, 
is an equivalence relation. 

10. Prove that the relation ft on Z° x Z°, defined by 

(x,y)fi(z,t) if and only if xt = yz, 

is an equivalence relation. Would this remain true if Z° were replaced by Z? 

11. A relation p on a set S is said to be circular if and only if for every x,y,z £ S, 
x py and y p z together imply that zpx. Show that a reflexive circular relation 
is an equivalence relation. Give an example of a circular relation that is not an 
equivalence relation. 

12. Suppose p is a reflexive and transitive relation on a set S. Define a new relation 
a as follows: a a b is true if and only if both a pb and a p a are true. Show that 
cr is an equivalence relation. 

13. Consider the set Z of integers and its power set P( Z). Define a relation p on 
P( Z) by 

A p B if and only if A + B is finite, 
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where A, B c Z and + is the symmetric difference of sets (defined in Sec¬ 
tion 2.4). Show that p is an equivalence relation. 

14. Let A be a finite set. Define a relation p on P (A) by Bi p Bj if and only if 
there exists a function f)j : Bi —> Bj such that / is one-one and onto, where 
Bi , Bj c A. Show that p is an equivalence relation on P(A). 

15. Let a and P be equivalence relations on a set A. 

(i) Show that a/3 is an equivalence relation if and only if ap = pa. 

(ii) Show that p = a Pi P is an equivalence relation. 

(iii) Is a U P necessarily an equivalence relation? 

(iv) Let a and P be congruence modulo m and n , respectively, on Z. Let p = 
a D p. Find the smallest positive integer equivalent to 0 under p. 

16. Let a and P be equivalence relations on A and let p = a D p. By Exercise 15(ii), 
p is an equivalence relation. Describe the partition induced by p in terms of 
those induced by a and p. 

17. (i) Consider the relation “congruence modulo 7” (written a = b (mod 7)) de¬ 

fined on Z. Into how many classes does this relation partition Z? 

(ii) Now define a relation p on Z by a p b if and only if a 2 = b 2 (mod 7). Show 
that p is an equivalence relation. Into how many classes does p partition Z? 

18. Let 77i and TJ 2 be two partitions of a given set U. We say Tl\ is finer than 772, 
and write Tl\ < 772, if and only if X e 77 1 implies ICI for some Y € 772. 

(i) Show that ^ is a weak partial ordering on the set of partitions of U for any 
set U. 

(ii) Show that -< cannot be a total ordering when \ U\ > 3. What is the situation 
when | U \ =21 

19. Suppose p is a partial order on a set A. Show that its inverse relation p~ l is also 
a partial order on A. 

20. Show that the relation <, where a<b means b = a + 1, is a predecessor-successor 
relation on Z°. 


4.3 Functions 


Functions arise in all parts of mathematics. The most familiar ones are simple alge¬ 
braic functions such as linear and quadratic functions, exponentials, and logarithms. 
Other number-based functions arise in trigonometry and calculus. We need to discuss 
functions in a more general way. 
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Definition of a Function 

A function, or mapping , / from a set S to a set T, is a rule that associates with each 
member of S exactly one member of T. In other words, a function is a special kind 
of relation from S to T, one in which each element of S occurs as the first element 
of precisely one ordered pair. This uniqueness property is the main point of the idea 
of a function. 

The set S is called the domain of the function, and the set T its codomain. If we 
used notation consistent with that of the last section, we would write / c S xT, but 
more often when dealing with functions we write / : S T\ the set form is then 
called the defining set or associated set of /. To indicate the correspondence between 
elements, we could again write ( s,t ) e /, but more often we write f(s) = t, as, for 
example, in familiar algebraic functions like /(x) = x 3 — 3x + 1. This is referred 
to as left notation. In algebraic and set-theoretic contexts, some authors use right 
notation and would write xf instead of /(x); you may find right notation in some 
books. 

The image of x under / is the value fix); if R is any subset of S, then f(R) = 
{fir) : r e R} is the set of images of elements of R and is called the image of R. In 
particular f(S), the image of S, is called the range of / and necessarily f(S) C T. 
The range and domain of / will be denoted 7 Z(f) and £>(/), respectively. To avoid 
trivial cases, it is usual to require that £>(/) should not be empty. 

The composition of two functions is defined just as the composition of relations 
is. In functions, it can be more simply described by saying “first perform one func¬ 
tion, then the other” (where performing f means replacing the value x by fix)). 
The composition “first /, then g” is written gifix)) in left notation, and this can be 
interpreted as defining a new function gif), by the natural rule 

£(/)(*) = •?(/(*)). 

In order for g(f)(x) to be defined, it is necessary that every value of fix) be in the 
domain of g. This will not necessarily be true for all x in P(x); in fact, the domain 
Of gif) is 

T>(gif)) = {x : x e 1Xf), fix) e V{g)}. 

This method of combining functions is not commutative; quite possibly gif) 
and fig) will be different functions. In fact, one might be defined and the other not. 
However, composition is associative because it is just a special case of composition 
of relations. 

In right notation, we would write xfg instead of g(/)(x); in either notation, the 
function to be applied first is written closest to the variable. 

Sample Problem 4.9. S is the set of all real numbers other than 0 and 1. Con¬ 
sider the following functions from S to S: 
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/i(x) = x; f 2 (x) = / 3 (x) = 1 - x; 

1 X x x - 1 

/ 4 (x) = --; / 5 (x) = —; /6(x) = --. 

1 — X x — 1 X 

Find formulas for / 2 (/ 2 ), hiff), and fsify), aftd verify that they are all mem¬ 
bers of the original set of six functions. Use these to verify that composition is 
not commutative. 

Solution. 

h(h)(x) = j = -U = x = /i(x); 

Mh)(x) = hi— 7) = 1-—r = 7^— = /4O); 

\x - 1/ x - 1 1 -x 

/s(/3)W = /sd - x) = - — = — = / 6 (x). 

1 — x — 1 X 

So / 2 (/ 2 ) = fu fzif 5 ) = /4 and / 5 (/ 3 ) = /6- 

Observe that fs(ff) / / 3 C/ 5 ), an example of noncommutativity. 

Practice Exercise. Find formulas for /i (/i), / 2 (/ 4 ), and fsiff), and verify that 
they are all members of the original set of six functions. 


One-One and onto Functions 

Two properties of functions are particularly important: we say that a function / is 
one-to-one if f(s\) = f(sf) always implies si = s 2 for any s\,S 2 e X>(/). In other 
words, / always maps different elements to different elements. We say that / is onto 
if /(S') = T or, in other words, if for every t e T, there exists an s e S such that 

m = t. 

These properties depend on the domain of /. For example, consider the func¬ 
tion / defined by /(x) = x 2 or / = {(x, x 2 ) : x e M}, which associates its square 
with each real number. This satisfies the definition of function, for if we choose x, 
then x 2 is uniquely determined. 

(i) If we define the domain of / to be the set M of all real numbers, and the 
codomain to be M also, then / is neither one-to-one (since fix) f = /(—x)) 
nor onto (since /(M) = {y e M : y > 0}). 

(ii) If instead we define the domain of / to be the set of all nonnegative real num¬ 
bers, but let the codomain still be the whole of M, then / is one-to-one but is 
not onto. 

(iii) If we let the domain of / be R but restrict the codomain to be the set of all 
nonnegative real numbers, then / is onto, but is not one-to-one. 

(iv) If both the domain and the codomain are defined to be the set of nonnegative 
reals, then / is both one-to-one and onto. 
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Inverse Functions 

Given any set S we can define the identity function is : S —>► S by is (s) = s for 
every s e S; in other words, 


is = {(s,s) : S e 5}. 


For a function, as for any relation, we can define an inverse relation. If the func¬ 
tion is, say f(x) = x 2 , and if we let the domain and the codomain equal M, then 
the inverse relation f~ l contains the pairs (4, 2) and (4, —2), so it is not a function: 
f~ l (x) would not be uniquely defined. This raises an important question: when is 
the inverse relation of a function also a function? The following theorem gives the 
necessary and sufficient conditions. 

Notice that the function f\ in Sample Problem 4.9 is the identity function on the 
set S , and in that problem we showed that /2 is its own inverse. 

Sample Problem 4.10. What is the inverse of the function f 4 from Sample Prob¬ 
lem 4.9? 

Solution. fa(x) = 1/(1 — x). Suppose y = 1/(1 — x). Then l — x = 1/y, so 

x = 1 - 1 /y = (y- 1 )/y. Therefore fp(y) = (y - Y)/y. So fp = f 6 . 

Practice Exercise. What is the inverse of the function fo from Sample Prob¬ 
lem 4.9? 

Theorem 26. Let f be a function, f : S —> T. Then f is one-to-one and onto if and 
only if its inverse relation f~ l is a function from T to S. 

Proof, (i) Suppose / is one-to-one and onto. Since / is one-to-one each element of 
T occurs as the second element of at most one ordered pair in /; since / is onto, 
each element of T occurs as the second element of at least one ordered pair in /. 
Hence if t e T, exactly one ordered pair (s,t) e /. 

Now consider the inverse relation: (t, s) e / -1 exactly when (s, t) e f. Hence 
for every t e T, exactly one ordered pair (t,s) e / -1 , so f~ l is a function from T 
to S. 

(ii) Suppose conversely that / _1 is a function from T to S. Then for every t e T, 
exactly one ordered pair (t, s) e f~ l and hence exactly one ordered pair (s,t) e /. 
Since at least one ordered pair has t as its second element for each t e T, we see that 
/ is onto; since at most one ordered pair has t as its second element for each t e T, 
we see that f is one-to-one. □ 

The following theorem gives us some information about the behavior of identity 
and inverse functions. 
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Theorem 27. Suppose f is any function from S to T . 

(i) fits) = f = 0 i T )f • 

(ii) If / is one-to-one and onto, then f~ l if) = ts and fif~ l ) = tr¬ 
im) If both / and g : T 1/ are one-to-one and onto functions, so that / -1 and 

g _1 are functions, then the composition g(/) is defined, (g(/)) _1 is a function 
and ( g(f))~ l = / _1 (g _1 ). 

Proof, (i) All three functions map T into S. For all t e T, 

{f(ts))(s) = f(ts(s)) = f(s) = tr(fis)) = (/ i T (f))(s ), 
which proves the statement. 

(ii) Since / is one-to-one and onto, f~ l is a function. Moreover, both / -1 (/) 
and ts map S to S\ both /(/ -1 ) and tj map T to T. For every s e S, exactly one 
ordered pair ( s,t ) belongs to /; the corresponding pair (t, s) e f~ l and is the only 
ordered pair in f~ l with t as its first element. Hence (s, s) e f~ l (f) for every 
s e S, and no element (^i, sf), with S 2 , belongs to / _1 (/)« So / -1 (/) = ts- 
Similarly /(/ -1 ) = it- 

(iii) Since the image of / equals the domain of g, the composition g(f) is de¬ 
fined. Suppose g(f)(s i) = g(f)(s 2 )- Since g is one-to-one, f(s\) = /fe); since / 
is one-to-one, si = S 2 - So fg is one-to-one. Now select u e U. Since g is onto, there 
exists an element t e T such that g(t) = u ; since / is onto, there exists an element 
s e S such that f(s) = t.So g(f)(s) = git) — u and gif) is onto. Since fg is 
one-to-one and onto, ifg)~ l is defined. 

Now consider u e U and let s be the unique element of S such that gif)is) = u 
and, equivalently, igif)~ l )iu) = s. Then 

r\ g - x )(u) = r\g~\u)) 

= where g{t) = u 

= s where f(s ) = t, 

so (g(/)) -1 and have exactly the same action on it. But since it was an 

arbitrary element of U, this means that = f~ l (g~ 1 )- □ 

Exercises 4.3 

lit Exercises 1 to 8, S = {1,2, 3, 4}. Is the given set a function from S to S? 

1. {(1,4), (2, 3), (3, 2), (2, 1), (4,4)}. 

2. {(1,1), (3,1), (4, 3)}. 

3. {(2,1), (3, 4), (1,4), (2, 1), (4, 2)}. 

4. {(1,4), (2, 4), (3, 3), (4, 2)}. 
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5. {(1,4), (2,1), (3, 4), (4,1), (3,4)}. 

6. {(1,1), (3,1), (1,2), (4, 2)}. 

7. {(1,2), (2, 2), (3,1), (4, 2)}. 

8. {(2, 2), (1,2), (2,1), (3, 4), (4, 2)}. 

9. Prove that ordinary addition of integers is a function + : (Z x Z) —> Z. What is 
the associated set? 

10. Define A = {a, b, c, d}. Does the following set define a function? 

{(a, (b, c)), (a, (c, b ))), (b, (c, c))}. 


11. Prove that {(x 2 ,x + 1) : x e M) is not the defining set of a function. Does 
{{x 2 ,x) : x e K*} represent a function, where K* is the set of the nonnegative 
real numbers? 


In Exercises 12 to 14, is the function one-to-one? Is it onto? 

12. / : K K, where AT = {1,2,...,*:} and /(./) = j + 1, for j = 1,2,..., 
*-l,/(*) = l. 

13. / : Z + —>■ {0, 1, 2, 3), where Z + is the set of positive integers, and 


/O') = 


0 if 7 j y,3 |y, 

1 if 7 | ./, 3 \ j, 

2 if 21 | j, 

3 otherwise. 


14. / : Q ^ Q, where f(q) = q 3 for q e Q. (Q denotes the set of rational 
numbers.) 

In Exercises 15 to 18 ,find the inverse of the given function. 

15. /6 : S S from Sample Problem 4.9, where S is the set of all real numbers 
other than 0 and 1. 


16. /5 : 5 —> 5 from Sample Problem 4.9 (same S as the preceding question). 


17. / : M —>► M + , where M is the set of real numbers, M + is the set of positive real 
numbers and 


fix) = 


2 — x if x < 1, 
l/x if x > 1. 


18. g : 


l, where M is the set of real numbers and 


g(x) = 


2 x if v < 0, 
3x if x > 0. 


19. Let / : A B be a given function. Suppose A'cA and B' B. Show that 
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(i) A'c(/-i(/))(A'),and 

(ii) c B', 

where = [a e A : f(a) e B'}. 

(Note: / -1 is the inverse binary relation of / considered as a binary relation ; 
f~ l need not be a function.) 

20. Let / : A -> B be a function. Show that it can be written as a composition 
/ = h(g), where g : A —>► C is an onto function and h : C —> 5 is a one-to-one 
function, for some appropriate set C. 

21. Let /, g,h be functions such that / is one-to-one, g(/) and h(f) are defined, 
and g(/) = h(f). Show that g = h. 

22. Let / : S ->► S' be one-to-one onto, and let g : S ->► S'. Show that 

(i) g is one-to-one if and only if f(g ) is one-to-one, 

(ii) g is onto if and only if f(g) is onto. 
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The Theory of Counting 


Enumeration answers the question “how many.” In a practical case, we find the num¬ 
ber of elements in a set by counting. So the theoretical counterpart is often called the 
theory of counting. 

A typical practical counting question is how many subsets of size 2 are there in 
the set {a , b, c}? This can be answered by listing the subsets—{< a , b}, {a, c}, and 
{b, c }—and then counting that there are three of them. A more general question 
is how many subsets of size 2 are there in a three-element set? A little thought 
convinces us that the number is the same for any three-element set, so the answer 
is again 3. But consider the question how many subsets of size 2 are there in an n- 
element set? This is truly theoretical, and to achieve the answer ^ n(n — 1) is a piece 
of enumeration. 

In this chapter we introduce the idea of an “event,” which enables us to clarify 
the sets we are counting and lays the foundation for the application of enumeration in 
studying probability later. We then study the basic tools for theoretical counting, in¬ 
cluding one-to-one correspondence, selections, arrangements, and the binomial the¬ 
orem. 


5.1 Events 

Experiments and Events 

In many cases we want to know how many ways an event can occur, or in how 
many ways a task can be performed. For this it is convenient to use the language of 
experimentation. 

An experiment is defined to be any activity with well-defined, observable out¬ 
comes or results. For example, a coin flip has the outcomes “heads” and “tails.” The 
act of looking out the window to check on the weather has the possible results “it 
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is sunny,” “it is cloudy,” “it is raining,” “it is snowing,” and so on. Both of these 
activities fit our definition of an “experiment ” 

The different possible outcomes of an experiment will be called sample points of 
the experiment. The set of all possible outcomes is the sample space. Each subset of 
the sample space is called an event. So discussions of experiments and their outcomes 
are, in fact, discussions of sets. 

This terminology is used because counting results, and the corresponding results 
on probability that will be developed in the next chapter, are often used by statis¬ 
ticians in analyzing experiments and testing conclusions from them. However, we 
shall also use the same words in other situations, such as the weather observations 
listed in the preceding paragraph, or listing the possible results in games. 

Many examples will involve ordinary dice, such as are used in games like Trivial 
Pursuit. Each die has six faces with the numbers 1 through 6 on them. The sample 
space is the set S = {1, 2, 3, 4, 5, 6}. The event “an odd number is rolled” is {1, 3, 5}. 

Sample Problem 5.1. Betty and John roll a die. If the result is a 1 or 2, John 
wins; otherwise Betty wins. What is the event John wins ? 

Solution. {1,2}. 

Practice Exercise. In the above game, what is the event Betty winsl 

Sample Problem 5.2. How many events are there associated with the roll of one 
die ? 

Solution. Each event corresponds to a subset of {1, 2, 3, 4, 5, 6}. There are 2 6 
subsets, so there are 2 6 possible events. 

Practice Exercise. How many events are there associated with the flip of one 
coin? 

Sample Problem 5.3. In an experiment three identical coins are flipped simulta¬ 
neously and the results are recorded. What is the sample space ? 

Solution. {HHH, HHT , HTT, TTT}. Notice that the order is not relevant 
here, so that, for example, the events HHT , HTH , and THH of Sample Prob¬ 
lem 5.5 are all the same event in this case. 

Practice Exercise. An experiment consists of flipping a quarter and noting the 
result, then flipping two pennies and noting the number of heads. What is the 
sample space? 
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Set Language 

Since events are sets, we can use the language of set theory in describing them. We 
define the union and intersection of two events to be the events whose sets are the 
union and intersection of the sets corresponding to the two events. For example, 
in Sample Problem 5.1, the event “either John wins or the roll is odd” is the set 
{1, 2}U{1, 3, 5} = {1,2, 3, 5}. (For events, just as for sets, “or” carries the understood 
meaning, “or both.”) The complement of an event is defined to have associated with 
it the complement of the original set, so the complement of A is “A does not happen ” 
Venn diagrams can represent events, just as they can represent sets. 

The language of events is different from the language of sets in a few cases. If S 
is the sample space, then the events S and 0 are called “certain” and “impossible.” If 
U and V have empty intersection, they are disjoint sets, but we call them mutually 
exclusive events. 

Sample Problem 5.4. A die is thrown. Represent the following events in a Venn 

diagram. 

A: An odd number is thrown. 

B: A number less than 5 is thrown. 

C: A number divisible by 3 is thrown. 

Solution. 



Practice Exercise. Repeat the above example for the following events. 
A: An even number is thrown. 

B: John wins (in the game of Sample Problem 5.1). 

C: Betty wins. 


The Multiplication Principle. Tree Diagrams 

It is sometimes useful to break an experiment down into several parts, forming what 
we shall call a compound experiment. For example, suppose you are planning a trip 
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from Los Angeles to Paris, with a stopover in New York. You have two options 
for the flight to New York: a direct flight with United or an American flight that 
stops in Chicago. For the second leg, you consider the direct flight with Air France, 
a British Airways flight through London, and Lufthansa stopping in Frankfurt. There 
are two ways to make the first flight and three to make the second, for a total of six 
combinations. 

In general, suppose the event A can occur in a ways, and the event B can occur 
in b ways. Then the combination of events A and B can occur in ab ways. This very 
obvious principle is sometimes called the multiplication principle or rule of product. 
In set-theoretic terminology, the combined event is A x B. The principle only works 
when the events are performed independently—if the result of A is somehow used to 
affect the performance of B , some combined results may be impossible. 

Sample Problem 5.5. (i) A coin is flipped three times; each time the result is 

recorded. What is the sample space? How many elements does it contain? 

(ii) Repeat this example in the case where you stop flipping as soon as a head is 

obtained. 

Solution, (i) The sample space has eight elements, 

{HHH, HHT, HTH , HTT , THH , THT , TTH, TTT}. 

(ii) The sample space has only four elements 

{H, TH , TTH , TTT). 

Notice that the multiplication principle says there will be 2x2x2 = 8 outcomes. 
It applies in the first problem, but not in the second. 

The possible outcomes of a compound experiment can be shown in a diagram 
called a tree diagram. A special point (called a vertex) is drawn to represent the 
start of the experiment, and lines (branches) are drawn from it to further vertices 
representing the outcomes, which are called the first generation. The outcome is 
shown as a label, either on its vertex or on the branch leading to it. If the experiment 
has two or more stages, the second stage is drawn onto the outcome vertex of the first 
stage, and all these form the second generation , and so on. 

Sample Problem 5.6. Draw tree diagrams for the experiments in Sample Prob¬ 
lem 5.5. 


Solution. The tree diagrams are (a) and (b), respectively. 
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HJD 




T X^H<3 


T-O 


(a) 


(b) 


Practice Exercise, (i) Draw a tree diagram for the experiment described in 
Sample Problem 5.1, with outcomes “Betty wins” and “John wins ” 

(ii) Betty and John play their game twice. Draw a tree diagram for this new ex¬ 
periment. 

Even when the two parts of a compound experiment are performed simultane¬ 
ously, we can use the multiplication principle and draw a tree diagram. For example, 
if you simultaneously flip a nickel and a dime there are four possibilities: 


head on nickel, head on dime; 
head on nickel, tail on dime; 
tail on nickel, head on dime; 
tail on nickel, tail on dime. 


Diagram (a) is the tree diagram for a coin flip and diagram (b) describes the flipping 
of a nickel and a dime. 





Nickel Dime 
(b) 


(a) 


Exercises 5.1 


1. A student takes courses in mathematics and computer science. Define the fol¬ 
lowing events. 

E\ She passes mathematics. 

F: She passes computer science. 

Write symbolic expressions (using E, F, H, U,~ , \, x, + ) for the events that 
the student 
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(i) fails mathematics, 

(ii) passes both subjects, 

(iii) passes exactly one subject, 

(iv) passes at least one subject, 

(v) fails both subjects, 

(vi) passes mathematics but fails computer science. 

2. There are three roads from town X to town F, four roads from town F to town 
Z, and two roads from town X to town Z. 

(i) How many routes are there from town X to town Z with a stopover in F? 

(ii) How many routes are there in total from town X to town Z? 

(Assume that no road is traveled twice.) 

3. International airport codes consist of three letters. How many codes are possible? 
(All 26 letters in the English alphabet can be used.) 

4. There are 13 contestants. No ties are allowed. In how many different ways can 
the judges award first, second, and third prize? 

5. In a multiple-choice quiz, each question has three different possible answers. If 
there are four questions, and all questions must be answered, how many different 
answer sheets are possible? 

6. The menu in a restaurant lists three appetizers, five entrees, and three desserts. 
In how many ways can you order a three-course meal of appetizer, entree, and 
dessert? 

7. Electronic components are being inspected. Initially one is selected from a batch 
and tested. If it fails, the batch is rejected. Otherwise a second is selected and 
tested. If the second fails, the batch is rejected; otherwise a third is selected. If 
all three pass the test, the batch is accepted. 

(i) How many outcomes are there in the sample space? List them. 

(ii) Draw a tree diagram for the experiment. 

(iii) What is the event “fewer than three components are tested?” 

(iv) What is the event “the batch fails?” 

8. The ACME company uses serial numbers consisting of three letters followed by 
five digits. How many possible serial numbers are there? 

9. An experiment consists of studying families with three children. B represents 
“boy,” G represents “girl,” and, for example, BGG will represent a family where 
the oldest child is a boy and the younger children are both girls. For convenience, 
assume there are no twins (so the three children have different ages). 

(i) What is the sample space for this experiment? 
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(ii) We define the following events. 

F: The oldest child is a boy. 

F : There are exactly two boys. 

(a) What are the members of E and El 

(b) Describe in words the event F H F. 

(iii) Draw a Venn diagram for this experiment and show the events E and E on 
it. 

10. A die is rolled twice and the results are recorded as an ordered pair. 

(i) How many outcomes are there in the sample space? 

(ii) Consider the following events. 

F: The sum of the throws is 4. 

F: Both throws are even. 

G: The first throw was 3. 

H : The two throws were identical. 

(a) Write down the elements of the events E , F, G, H , FUF, G H H, 
FHH. 

(b) Write descriptions, in words, of the sets FUF, G H H, FPi/F 

(c) Are any pair of the events F, F, G, H mutually exclusive? 

(iii) Draw a tree diagram for this experiment. 

11. An experimenter tosses four coins—two quarters and two nickels—and records 
the number of heads. For example, two heads on the quarters and one on the 
nickels is recorded as 21. 

(i) Write down all members of the sample space. 

(ii) We define the events as follows. 

F: There are more heads on the quarters than on the nickels. 

F: There are exactly two heads in total. 

G: The number of heads is even. 

(a) List the members of events E,F,G,EUF,EnF,EnG,FnG. 

(b) Write descriptions in words of the events E H F, F H G. 

(c) Are any two of the events F, F, G mutually exclusive? 

(iii) Draw a tree diagram for this experiment. 

12. Your doctor tests your cholesterol level each month. She makes three tests and 
records whether the reading is higher than (H ), the same as ( S ), or lower than 
(L), the corresponding test from the preceding month. 
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(i) What are the possible outcomes of this experiment? List all members of the 
sample space. 

(ii) Consider the following events. 

E\ The cholesterol level never decreases. 

F : The cholesterol level decreases at least twice. 

(a) Write down the elements of E, F , and E . 

(b) Write a description in words of the sets E, F. 

(c) Are E and F mutually exclusive? 

(iii) Draw a tree diagram for this experiment. 

13. A bag contains two red, two yellow, and three blue balls. In an experiment, one 
ball is drawn from the bag and its color is noted, and then a second ball is drawn 
and its color noted. 

(i) Draw a tree diagram for this experiment. 

(ii) How many outcomes are there in the sample space? 

(iii) What is the event “two balls of the same color are selected?” 

14. Workers in a factory are classified by gender, experience, and union membership. 
Events are defined as follows. 

E \ The worker is male. 

F: The worker has one year’s experience at least. 

G: The worker belongs to the union. 

(i) Write expressions in symbols (using E, F, G, Pi, U, and “) for the following 
events. 

(a) The worker is a woman who belongs to the union. 

(b) The worker has less than one year’s experience and is a man. 

(c) The worker has at least one year’s experience and is a woman union 
member. 

(iii) Represent the set of all workers on a Venn diagram with the sets E, E, and 
G shown. Shade in the set defined in (ii)(c) above. 

In Exercises 15 to 22 two events are shown. Are they mutually exclusive? 

15. A die shows a 4. 

A die shows an odd number. 

16. A die shows a 4. 

A die shows a perfect square. 

17. A person is male. 

The same person is a store clerk. 
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18. A person is a college freshman. 

The same person is a college sophomore. 

19. A coin is tossed and falls heads. 

The coin falls tails. 

20. A student wears a watch. 

The student wears sneakers. 

21. Joe is a freshman. 

Joe is a sophomore. 

22. A die is rolled and gives an odd number. 

The roll is greater than 4. 

23. The events E and F are mutually exclusive. Which of the following are mutually 
exclusive pairs? 

(i) E and F. 

(ii) E and E. 

(iii) E and F. 

24. If E and E are two events, write symbolic expressions (using the symbols 
E, F, (T, U, and - ) for the following events. 

(i) E occurs but E does not. 

(ii) E and E both occur. 

(iii) E does not occur. 

(iv) Exactly one of the two occurs. 

25. Suppose E and F are two events. Must E and ( E U F) be mutually exclusive? 


5.2 Unions of Events 

The Rule of Sum 

Suppose S and T are any two sets and you need to list all members of S U T. If you 
list all members of S , then list all the members of T, you will cover all members of 
S U T, but those in S (T T will be listed twice. To count all members of S U T, you 
can count all members of both lists, then subtract the number of duplicates. In other 
words, 

isuri = |S| + \t\ - |snr|. (5.1) 

If S is the set of all the objects in some set that have some property A and T is 
the set of all the objects with property B , then (5.1) expresses the way to count the 
objects that either have property A or property B : 
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(i) count the objects with property A; 

(ii) count the objects with property B\ 

(iii) count the objects with both properties; 

(iv) subtract the third answer from the sum of the other two. 

Sample Problem 5.7. How many positive integers smaller than 100 are rela¬ 
tively prime to 15? 

Solution. Since 15 = 5x3, and 5 and 3 are primes, we need to know how many 
of the positive integers smaller than 100 are neither divisible by 5 nor divisible 
by 3. Write S for the set of positive integers smaller than 100 that are multiples 
of 5, and T for the similar set of multiples of 3. Then S = {5, 10, ..., 95}, so 
|Sj = 19; similarly \T\ = 33. Also S P\ T = {15, 30, ..., 90} has six elements. 
From (5.1), 

\SUT\ = |Sj + \T\ - \S H T\ = 19 + 33 - 6 = 46, 
and the required answer is 99— \S U T\ = 53. 

Another rule, sometimes called the rule of sum, can be simply expressed by say¬ 
ing “the number of objects with property A equals the number that have both prop¬ 
erty A and property B , plus the number that have property A but not property 5”; in 
terms of the sets S and T, this is 

\s\ = |snr| + |s\r|. (5.2) 

The two rules in (5.1) and (5.2) can be combined to give 

\sut\ = |r| + |s\r|. (5.3) 

Sample Problem 5.8. There are 40 students in Dr. Brown \s discrete mathematics 
course and 50 in his calculus section. If these are his only classes, and if 20 of the 
students are taking both subjects, how many students does he have altogether? 

Solution. Using the notations D for the set of students in the discrete class and 
C for calculus, 

\D U C\ = \D\ + \C\ - \D H C\ = 40 + 50 - 20 = 70, 
so he has 70 students. 

Practice Exercise. A survey shows that 12 newspaper readers buy the morning 
edition and seven buy the evening edition; three of these also bought the morning 
paper. How many readers were surveyed? 


5.2 Unions of Events 


123 


Using Venn Diagrams 

The kind of data we have been discussing are very easily represented in a Venn 
diagram. The number of elements in a set is written in the area corresponding to that 
set. 


Sample Problem 5.9. Represent the data concerning Dr. Brown’s students (from 
Sample Problem 5.8) in a Venn diagram. 


Solution. There are four areas in the diagram: the set of students in both classes 
(the center area) has 20 members; the set of students in discrete mathematics 
only (the left-hand enclosed area) has 20 members—subtract 20 from 40; the 
area corresponding to calculus-only students, with 50 — 20 = 30 members, and 
the outside area, which has no members (only Dr. Brown’s students are being 
considered). The diagram is 



Practice Exercise. Represent the data concerning newspaper readers in a Venn 
diagram. 

This method can be used to find the number of elements in the set represented by 
any of the areas or unions of areas, provided there is enough information. 

Sample Problem 5.10. A group of 100 people were surveyed to find out whether 
newspaper advertisements or flyers were more efficacious in advertising super¬ 
market specials. T wenty of them said they pay no attention to either medium. 
Fifty said they read the flyers, and 15 of those said they also check the newspa¬ 
pers. How many use the newspaper advertisements in total? 

Solution. This can be done in various ways. In the obvious notation, |F| = 50 
and \N Pi F\ = 15. Moreover, since 20 of the 100 read neither, \N U F\ = 80. 
Using (5.1), \NUF\ = |V| + |F| — \N D F\ yields 

\N\ = \NUF\- \F\ + \N H F\ = 80 - 50 + 15 = 45. 

Alternatively, we could observe that \F\N\ = 35, so (5.3) is \N U F\ = |V| + 
\F\N\, so 


\N\ = \NUF\- \F\N\ = 80 - 35 = 45. 
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Or we could draw the Venn diagram 



and observe that, in order for the total to add to 100, X = 30, so X + 15 = 45. 


These methods can be applied to three or more sets. 

Sample Problem 5.11. A group of 500 people were asked about their morning 
vitamin intake. It was found that 150 take vitamin B, 200 take vitamin C, 165 take 
vitamin E, 57 take both B and C, 125 take both B and E, 82 take both C and E , 
and 52 take all three vitamins. How many take both B and E, but do not take C? 
How many take none of these vitamins? 


Solution. We start with the diagram 



From the data, v = 52 and u + v = 57, so u = 5. Similarly, w = 73 and y = 30. 
Now t + u + v + w = 150, so t = 20. The other sizes are calculated similarly, 
and we get the diagram 
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The cell corresponding to “B and E but not C” has 73 elements. There are 303 
elements in total, so there are 197 people who take none. 

Practice Exercise. In a survey of 150 moviegoers, it was found that 80 like hor¬ 
ror movies, 75 like police procedurals, and 60 like romances. In total, 35 like both 
horror and procedurals, 25 like horror and romance, and 30 like procedurals and 
romance. Fifteen like all three types of movie. Represent these data in a Venn 
diagram. How many like horror and police procedural movies, but do not like 
romances? How many like romances only? How many like none of these three 
types? 


Inclusion and Exclusion 

Equation (5.1), with S replaced by X , is 

|xur| = |X| + \t\ - |xnr|. 

Now suppose T is itself a union, Y U Z. Then 

|XUXUZ| = |X| + \Y UX| - |Xn(FUZ)|. 

The term | Y U Z | can be replaced by 

|7| + |Z| — |rnz|; 

as a consequence of the distributive laws for intersections and unions of sets, X n 
(Y U Z) = (X IT Y) U (X n Z), and the last term can be written as 

\xhy\ + \xnz\ — \xhy nz\. 

So the whole expression is 

IZUFUZI = \X\ + \Y\ + \Z\ 

-\xnY\-\Ynz\-\xnz\ 

+ \xnYnz\. 

It is instructive to look at the Venn diagram in Figure 5.1. We want to count 
X U Y U Z. If we add \X\ to \Y\ and \Z\, we count all points lying in the lightly 
shaded area twice, and all points in the heavily shaded area three times. Subtracting 
\X (T Y |, \X (T Z|, and \Y D Z\ takes away 1 for each point in the shaded area, but 
takes away 3 for every heavily shaded point. So we must add the number of heavily 
shaded points, which equals \X fl Y Pi Z\. 

This formula can be generalized. The general form is called the principle of in¬ 
clusion and exclusion. 
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Theorem 28. The number of elements in X\ U X 2 U • • • U X n is 

n n n 

E l^ 1 nx i - 2 | + ... 

i\=l i\ — \ i 2 —i\ Tl 

n n n 

+ (-i)*“ 1 E E ••• E \x h nx h n-..nx ik \ 

i\ = 1 ii—h + 1 ik = ik— 1 Tl 

+ --- + (-i)"- 1 |x,' 1 n^n-nxj. 

We shall prove this theorem in Section 5.7. 

Exercises 5.2 

1. Find the number of positive integers that are smaller than 100 and are not divis¬ 
ible by 5 nor by 7. 

2. How many positive integers less than 200 are not divisible by 3 and are not 
perfect squares? 

3. How many elements are there in the set S U 7\ if \S\ =24, |T| = 30, and 

(i) S and T are disjoint? 

(ii) |sn T\ = 13? 

(iii) KH 

4. How many elements are there in the set S Pi T, if \S\ = 14, \T\ = 17, and 

(i) S and T are disjoint? 

(ii) |SU T\ = 21? 

(iii) 5CH 

5. A group of 1,865 voters were surveyed about a new highway. Of them, 805 were 
in favor of financing it with a new tax, 627 favored tolls on the highway, while 
438 said they would vote in favor of either measure. 

(i) How many favor taxes but not tolls? 
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(ii) How many favor tolls but not taxes? 

(iii) How many would vote against either form of funding? 

6. Among 1000 personal computer users it was found that 375 have a scanner and 
450 have a DVD player attached to their computer. Moreover 150 had both de¬ 
vices. 

(i) How many had either a scanner or a DVD player? 

(ii) How many had neither device? 

7. Some shoppers buy bread, butter, and coffee. Ten buy bread, 15 buy butter, and 
14 buy coffee. The number buying both bread and butter, both bread and coffee, 
and both butter and coffee are four, five, and eight, respectively. Three buy all 
three products. How many shoppers were there (assuming all bought at least one 
of the three)? 

8. A survey was carried out. It was found that ten of the people surveyed were 
drinkers, while five were smokers; only two both drank and smoked, but seven 
did neither. Use formula (5.1) to find out how many people were interviewed. 

9. Sixty people were asked about news magazines. It was found that 32 read 
Newsweek regularly, 25 read Time , and 20 read U.S. News and World Report. 
Nine read both Time and U.S. News and World Report , 11 read both Newsweek 
and Time , and eight read both Newsweek and U.S. News and World Report. Eight 
of the people do not read any of the magazines. 

(i) How many read all three magazines? 

(ii) Represent the data in a Venn diagram. 

(iii) How many people read exactly one of the three magazines? 

10. Of 100 high-school students, 45 were currently enrolled in mathematics courses, 
38 were enrolled in science, and 21 were enrolled in geography. There were 18 
in both math and science and nine in both math and geography, while 12 were 
taking geography but neither of the other subjects. There were 23 not currently 
enrolled in any of the three subjects, and five enrolled in all three. Represent 
these figures in a Venn diagram. How many students were taking precisely one 
of the three subjects? How many were taking precisely two? 

11. Eighteen people are interviewed. Of these, seven dislike the Republican party, 
ten dislike the Democrats, and eleven dislike the Reform Party. Moreover, five 
dislike both Democrats and Republicans, five dislike both Republicans and the 
Reform Party, while six dislike both Democrats and the Reform Party. And four 
dislike all three types of politicians, on principle. How many like all three? 

12. Of 100 personal computer users surveyed, 27 use Dell, 35 use Gateway, and 35 
use Hewlett-Packard. Ten of them use both Dell and Gateway, eight use both 
Dell and Hewlett-Packard, and twelve use both Gateway and Hewlett-Packard. 
Four use all three. 
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(i) How many use exactly one of these brands? 

(ii) How many only use other brands? 

13. A shopping mall contains K-Mart, Walmart, and Target stores. In a survey of 
100 shoppers, it is found that 47 shopped at Walmart; 61 shopped at K-Mart; 
52 shopped at Target; 32 shopped at both Walmart and K-Mart; 35 shopped at 
both K-Mart and Target; 22 shopped at both Walmart and Target; 12 shopped at 
all three stores. 

(i) How many people shopped only at Walmart? 

(ii) How many people shopped at Walmart and K-Mart, but not at Target? 

(iii) How many people shopped at exactly one of the stores? 

(iv) How many people shopped at none of the three stores? 

14. A survey of students found that 62 were enrolled in calculus; 71 were enrolled 
in algebra; 67 were enrolled in discrete mathematics; 37 were enrolled in both 
calculus and algebra; 32 were enrolled in both calculus and discrete mathemat¬ 
ics; 40 were enrolled in both algebra and discrete mathematics; 12 were enrolled 
in all three subjects; 44 were enrolled in none of the three. 

(i) How many were enrolled in both calculus and algebra but not in discrete 
mathematics? 

(ii) How many were enrolled in exactly one of the three subjects? 

(iii) How many students were surveyed? 

15. Four hundred researchers at IBM were surveyed about their qualifications in 
computer science. It was found that 300 have at least one degree in computer 
science: 214 have a Bachelor’s degree, 123 have a Master’s degree, and 99 have 
a Ph.D. It was further found that 57 have both Bachelor’s and Master’s degrees, 
74 have both Master’s and a Ph.D., and 22 have both Bachelor’s and Ph.D. de¬ 
grees. 

(i) How many of the researchers have all three degrees? 

(ii) How many came into computer science from another field; that is, they have 
either a Master’s or a Ph.D. in computer science, but no Bachelor’s degree? 

16. How many positive integers smaller than 200 are not divisible by 3, 5, or 7? 

17. Derive the formula 

\A U C\ = |A| + \B nc| + \B nc| 

-\AnBnc\ - |Anfinc|. 
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5.3 One-to-One Correspondences and Infinite Sets 

One-to-One Correspondence 

Two sets, S and T, are said to be in one-to-one correspondence if there is a one-to- 
one function from S onto T. We then write S = T. 

Suppose / is such a one-to-one, onto function. We say the correspondence is 
given by /. For every element s of S, there is exactly one corresponding element 
of T, namely f(s). Moreover / has an inverse, / -1 , and for every element t of T, 
there is exactly one corresponding element of S, namely f~ l {t). So f~ l gives a 
one-to-one correspondence from T to S. In fact we can say more. 

Theorem 29. Suppose U is any set. Then = is an equivalence relation on the power 
set V{U). 

Proof. It is necessary to show that = is reflexive, symmetric, and transitive. 

The reflexive property is easy: for any set S, the identity function is is a one-to- 
one map from S to itself. 

If S = T, with the correspondence given by / : S -> T, then f~ l gives maps 
T = S. Therefore = is symmetric. 

Suppose R = S and S = T: say the two correspondences are given by / : R -> 
S and g : S T. Then g(f) is a one-to-one function from R onto T, so R = T. 
Therefore = is transitive. □ 

Sample Problem 5.12. Show that the set O of all odd integers is in one-to-one 
correspondence with the set E of all even integers. 

Solution. We use the function f(n) = n +1. If n is odd then n +1 is defined and 
is even, so / is a function from © to E. To see that / is one-to-one, we observe 
that ifm + 1 ^ n + 1, then subtracting 1 from each side gives m / n. Every 
even integer arises from an odd integer: m is f(n — 1); so / is onto. (We could 
have shown that / is onto by noting that the inverse function f~ l is defined by 
— n — 1 and may be applied to any even integer.) 

Practice Exercise. Show that the set of all positive integers less than 27 is in 
one-to-one correspondence with the set of letters of the English alphabet. 

The importance of one-to-one correspondences in the theory of enumeration 
comes from the following theorem. 

Theorem 30. Suppose S is a set with n elements, and suppose there is a one-to-one 
correspondence f : S —> T. Then T has n elements. 
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Proof. For convenience, let us write S = {a\, < 22 , ..., a n }. Then T = f(S) = 
{ f(a \), f(ci 2 ), ..., f(a n )} because T is onto. As we have written only n elements 
f(ai), T can have at most n elements. On the other hand, if T has fewer than n ele¬ 
ments, then f{ai ) = f (aj) for some i and j, i / j. But this is impossible because 
/ is one-to-one. So T has precisely n elements. □ 

The usual way in which Theorem 30 is used is as follows. If it is required to find 
the number of elements of a set S', find some set T that is in one-to-one correspon¬ 
dence with S, such that \T\ is already known, or is easy to find. Then |S| = \T\. 


Finite and Infinite Sets 

The idea of a one-to-one onto correspondence is useful in the formal discussion of 
finite and infinite sets. Suppose S is any finite set and T is a subset of S. If there is a 
one-to-one function from S onto T, then Theorem 30 says that T has \S\ elements, 
so T = S. On the other hand, if S is infinite, it is possible to find a proper subset 
T of S and a one-to-one correspondence taking S to T. The simplest example is the 
function f(x) = x + 1, which maps Z* onto Z + , proving that Z* = Z + , even though 
Z + is a proper subset of Z*. The inverse function, = x — l,isa one-to-one 

correspondence from Z + to Z*. 

Sample Problem 5.13. Consider the function f(z ) = z 2 . Show that f is a one- 
to-one correspondence from Z* to the set of integer squares, S = {0, 1,4, 9, 

16,...}. 

Solution. Every perfect square arises from squaring an integer, so / is onto. If 
f(z) = /(*), then z 2 = x 2 , and (as negatives have been excluded) z = x. So f 
is one-to-one. 

Practice Exercise. Show that the function g : g(x) = 2x is a one-to-one corre¬ 
spondence from Z to the set E of even integers. 

Theorem 31. Let A be a finite set and let f be a function from A to itself 

(i) If f is one-to-one, then f is onto. 

(ii) If f is onto, then f is one-to-one. 

Proof, (i) We leave the proof of this part as an exercise. 

(ii) Let A be {a\, 02 , ..., a n ). Since / is onto, /(A) = A, so 
{f(a l ),f(a 2 ),...,f(a n )} = A. 

Suppose f(at) = f(aj) for some i / j, 1 < /, j <n. Then /(A) has at most n — 1 
distinct elements, contradicting the fact that A has exactly n distinct elements. □ 
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Countable Sets 

Among the infinite sets, those that can be put into one-to-one correspondence 
with Z + , the set of positive integers, are called countably infinite , and the others are 
uncountable. Many (but not all) writers also say that finite sets are also countable. 

Sample Problem 5.14. Prove that the set 77 of prime numbers is countably infi¬ 
nite. 

Solution. We know from Theorem 2 that 77 is infinite. The smallest prime num¬ 
ber is 2. Write pi 1) = 2. Similarly, define pin) = 3. In general, define pin) 
to equal the nth prime number. This is well defined even though we know of no 
formula for pin): once the value pin — 1) is known, it is, at worst, necessary 
to test all the integers from pin — 1) + 1 to pit) x p(2) x • • • x pin — 1) for 
primacy (see the proof of Theorem 2). If i / j then pii) / pij). The function 
p is a one-to-one correspondence from Z + to 77. 

Practice Exercise. Prove that the set of odd integers is countably infinite. 

Theorem 32. Every countably infinite set can be put in one-to-one correspondence 
with a proper subset of itself. 

Proof. Suppose S is a countable set, and suppose / is a one-to-one and onto function 
from S to Z + . There will be a unique element of S that is mapped to 1 by /; let us 
call this element z- Now consider the function g, defined by gix) = f~ l (1 + fix)). 
As /, / -1 , and v —> v + 1 are all one-to-one and onto functions, it follows that g 
is one-to-one and onto. It carries S to T = S\{z}, the set of all elements of S other 
than z. So S is in one-to-one correspondence with its proper subset T. □ 

It may be shown that every infinite set contains a countably infinite subset. To 
prove this we need to use some advanced set theory, including the Axiom of Choice, 
which lies outside the scope of this book. The argument goes roughly as follows. 

Suppose A is any infinite set. Select an element—call it a\ —of S. Write Si to 
denote the set {a\}, and write Ai for the result of deleting a\ from S: A\ = S\{< 2 i}. 
Ai is again infinite. Now select an element a 2 in Ai, and write S 2 = {a\, 02 } and 
A 2 = Ai\{^ 2}- Proceed in this way: when S n and A n have been determined, then 
select a n+ \ in A n , and write S n +\ = S n U {a n+ \} = {a\, a 2 , ..., a n + 1 } and A n+ i = 
A n \{a n + 1 } = A\S n +i. It can be seen that this process defines an infinite subset 

S = [a\, &2, • • •}, 

of A, and S is clearly countably infinite because the function / : /(/) = <Zj is a 
one-to-one correspondence between Z + and S. 

(The problem, and the need for an axiom, comes because the definition of S 
requires an infinite number of steps. The situation is rather like the one for which the 
Principle of Mathematical Induction had to be introduced.) 
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Now suppose A is an infinite set and S is a countably infinite subset of A. Select 
a proper subset T of S that is in one-to-one correspondence with S ; say the function 
g : S ^ T is one-to-one and onto. Define B = (A\S) U T, the set obtained by 
deleting from A all those elements of S that are not in T. Then B is a proper subset 
of A, and the function /, defined by 

if s e S, 
if s e a\S , 

is a one-to-one correspondence from A to B. 

So any infinite set is in one-to-one correspondence to a proper subset of itself. 

This, in fact, characterizes infinite sets. We could say that a set is infinite if and 
only if it is in one-to-one correspondence with some proper subset of itself. 

Exercises 5.3 

1. Suppose |A| = m and \B\ = n and suppose C c A x B. What is the greatest 
value possible for | C | if C is the set corresponding to a function? 

2. If / : A -> B and \A\ < \B\, can / be onto B1 

3. Let 15*1 = 2. How many functions are there from S to SI How many onto func¬ 
tions? How many one-to-one functions? 

4. Repeat the preceding exercise in the case where \S\ —n. 

5. Show that the elements of A x B are in one-to-one correspondence with the 
elements of B x A. 

6. Prove that / : v —> 2x is a one-to-one correspondence from {x : 0 < v < 1} to 
{x : 0 < x < 2}. 

7. Find a one-to-one correspondence between the set of all nonnegative real num¬ 
bers and the set of all real numbers. 

8. Prove Theorem 31 (i). 

9. Say |A| = m, \B\ = n. What can you say about the following orders? 

(i) |A U 5|, (ii) |A n B\, (iii) \A\B\, (iv) \A + B\. 

10. Suppose S is a countable set and T is a finite set. Prove that S U T is countable. 

11. Suppose S and T are countable sets. Prove that S U T is countable. 

12. If A and B are uncountable sets, is A U B always uncountable? 

13. If A and B are uncountable sets, is A D B always uncountable? 
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5.4 Arrangement Problems 

Sequences on a Set 

Suppose S' is a set with s elements. We often need to know how many ordered sets 
or sequences of size k (often called k-sequences , or permutations of length k) can 
be chosen from S. This number is denoted (s)k or P(s, k). In particular, ( s) s de¬ 
notes the number of s-sequences that can be chosen from an s-set, which is the same 
as the number of different ways of ordering an s-set; it is given the special name 
“s factorial,” and written si. These ^-sequences are called permutations of S. 

Given an s-set S = {x\, X 2 , ..., x s ], there are s different sequences of length 1 
on S, namely (jq), (* 2 ),..., and (x s ). So (s) 1 = s. There are s-(s — 1) sequences 
of length 2 because each sequence of length 1 can be extended to length 2 in s — 1 
different ways, and no two of these s • (s — 1) extensions will ever be equal. So 
(s') 2 = s(s — 1). Similarly we find 

(s) 3 =s-(s-l)-(s-2), 

( 5)4 = s-(s — l)-(s — 2)-(s — 3), 


(s) k =s-(s - 1 )-(s - 2)---(s -k + 1). 

So (s)k is calculated by multiplying, s,s — l, s — 2, ..., until there are k factors. 

It follows that 5 ! = s-(s — 1 )-(s — 2) • • • 3-2 -1 and that 

(s)k = s\/(s — k)\. (5.4) 

For convenience we define 0! to equal 1. 

Sample Problem 5.15. Hat are 10! and (10)3 

Solution. 10! = 10-9-8-7-6-5-4-3-2.1 = 362880. 

There are two ways to calculate ( 10 ) 3 . We could say ( 10)3 = 10-9-8 = 720. Or 
we could use the formula ( 10)3 = 10!/7! = 362880/5040 = 720. The first way 
is easier. 

Practice Exercise. Find 6 ! and ( 6 ) 4 . 

Sample Problem 5.16. Find the number of different ways of arranging n women 
around a circular conference table. 

Solution. Suppose we arbitrarily label one seat at the table as “1,” the one to its 
left as “2,” and so on. There are n\ different ways of putting the n women into 
the n seats. However, we have counted two arrangements as different if one is 
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obtained from the other by shifting every woman one place to the left because 
these two arrangements put different women in “1but these are clearly the same 
arrangement for the purposes of the question. Each arrangement is one of a set 
of n , all obtained from the others by shifting in a circular fashion. So the number 
of really different arrangements is n\/n, which equals ( n — 1)!. 

Practice Exercise. Find the number of different ways of arranging n men in one 
straight row of chairs. 

Sample Problem 5.17. A committee of three — Chair, Secretary, and Treasurer — 
is to be elected by a club with 14 members. If every member is eligible to stand 
for each position, how many different committees are possible ? 

Solution. We can treat the committee as an ordered set of three elements chosen 
from the 14-element set of members. So the answer is (14)3, or 2184. 

Practice Exercise. What is the number of possible committees if there are 
12 members? 


Combining Arrangements with the Multiplication Principle 

The methods used here are often combined with the multiplication principle. We 
illustrate with several examples. 

Sample Problem 5.18. Three boys and four girls are to sit along a bench. The 
boys must sit together, as must the girls. How many ways can this be done ? 

Solution. First, decide whether the boys are to be on the left or on the right. This 
can be done in two ways. Then order the boys—there are 3! = 6 ways. The girls 
can be ordered in 4! = 24 ways. So there are 2-6-24 = 288 arrangements. 

Practice Exercise. How many ways could the three boys and four girls be ar¬ 
ranged around a circular table if the boys must sit together and the girls as well? 

Sample Problem 5.19. A copying company has eight photocopying machines 
and seven employees who can operate them. There are four copying jobs to be 
done. How many ways are there to allocate these jobs to operators and ma¬ 
chines ? 

Solution. Call the jobs A, B, C, D. Choose two arrangements: first, which four 
operators should do the jobs; second, which machines should be used. The oper¬ 
ator choice can be made in P(7, 4) ways, and the machines in P( 8, 4) ways. In 
each case, the first member of the sequence is the one allocated to job A, the sec¬ 
ond to job B , and so on. There are P(7, 4) x P(8, 4) = 7 x 6 x 5 x 4 x 8 x 7 x 6 x 5 
= 1,411,200 ways. 
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Sample Problem 5.20. A high school class contains 10 boys and 13 girls. They 
have boys’ and girls’ charity drives and wish to elect a Chair and Treasurer for 
each. How many combinations are possible ? 

Solution. There are (10)2 ways to select the boys’ committee, and (13)2 ways 
to select the girls’ committee. So the total is (10)2■ (13)2 = 90-156 = 14,040. 

Practice Exercise. What is the answer if each committee also has a Secretary? 

Sample Problem 5.21. The club in Sample Problem 5.17 wish to elect a by-laws 
committee with three members — Chair, Secretary, and Legal Officer—and require 
that no members of the main club committee should be members. In how many 
different ways can the two committees be chosen? 

Solution. Suppose the main committee is chosen first. There are (14)3 =2184 
ways to do this. After the election, there are 11 members eligible for election to 
the by-laws committee, so it can be chosen in (11)3 = 990 ways. So there are a 
grand total of 2184-990 = 2,162,160. 

In the last Sample Problem we see that, even in small problems, the numbers get 
quite large. It is better to report the answer in its factored form, assl4-13-12-ll-10-9. 
This form of answer also makes it clear that we could have solved the problem by 
treating the two committees as one six-member sequence, with (14)6 possible solu¬ 
tions. 

Even a small alteration to a problem can make it considerably more complicated. 
For example, suppose that one member of the club, Dr. Jones, decides she does not 
wish to stand for the by-laws committee, although she is happy to be on the main 
committee. In each case where Dr. Jones is a member of the main committee, there 
are ( 11)3 possible by-laws committees, but in the other cases there are only ( 10)3 
possibilities. To proceed, it is necessary to know how many of the possible main 
committees contain Dr. Jones and how many do not. There are (13)3 committees 
without Dr. Jones, so there are ((14)3 — (13)3) of which she is a member. So the total 
required is 

[(13) 3 *(10) 3 ] + [((14)3 - (13) 3 )]-(11>3 
= [(13-12-11)-(10-9-8)] + [(14-13-12) - (13-12-ll)]-(ll-10-9) 

= [1716-720] + [2184 - 1716]-990 
= 1,235,520 + 463,320 = 1,698,840. 


Arrangements with Repetitions 

If repeated elements are allowed, the number of sequences that can be formed from a 
set is far larger. If there is no restriction, then s k ^-sequences can be formed from an 
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k 

(6 )* 

6 k 

( 6 ) 4 / 6 * 

1 

6 

6 

1.00000 

2 

30 

36 

0.83333 

3 

120 

216 

0.55556 

4 

360 

1296 

0.27778 

5 

720 

7776 

0.09259 

6 

720 

46656 

0.01543 


Table 5.1. Numbers of sequences without and with repetitions 


s-set. To show just how great this difference is, even in small cases, Table 5.1 shows 
the relative values of (6)& and 6 k . 

In some cases, repetitions are allowed but limited. Problems of this kind can be 
modeled by talking about sets with a certain number of copies of each element. For 
example, if you want to count the number of sequences of length 5 based on the 
set {A, B} that contain no more than three A ’s and no more than four B’ s, you could 
instead talk about five-sequences selected from the collection {A, A, A, B, B, B, BJ. 

To avoid the problems that can arise from having multiple copies of the same 
element, we often talk of distinguishable and indistinguishable elements. For exam¬ 
ple, many problems can be modeled in terms of selecting marbles from an urn. The 
usual convention is that two sequences are distinguishable only if they differ in color 
sequence: any two blue marbles are indistinguishable. 

One way to tackle these problems is to assume the “indistinguishable” objects can 
be distinguished, and then take this into account. For example, consider the letters in 
the word ASSESS. In how many distinguishable ways can you order these letters? 

Suppose the letters were written on tiles with numbers as subscripts, like Scrabble 
tiles. Label them so that no two copies of the same letter get the same subscript, 
for example A 1 S 1 S 2 E 1 S 3 S 4 . Then all the six letters are different, and there are 6! 
orderings. Say you have each of these orderings written on slips of paper. 

Now collect together into one pile all the slips that differ only in their subscripts. 
For example, A\E\ S\ S 2 S 3 S 4 and A 1 E 1 S 2 S 1 S 3 S 4 will be in the same pile, as will 
A 1 E 1 S 2 S 3 S 1 S 4 , A 1 E 1 S 4 S 2 S 3 S 1 , and several others. In fact, we can work out how 
many slips there are in a pile. There are four letters S , and one each of the others. 
Two slips will be in the same pile when they have the letters in the same order, but 
the subscripts on the S 's are in different order. There are 4! = 24 ways to order the 
four subscripts, so there are 4! slips in each pile. Therefore there are 6!/4! = 30 piles. 

Two orderings can be distinguished if and only if their slips are in different piles, 
so there are 6!/4! = 30 distinguishable orderings of ASSESS. 

The same principle can be applied with several repeated letters. For example, if 
SUCCESS is written as Si U\ C\ C 2 E\ S 2 S 3 , we see that there are 2! ways of ordering 
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the C’s and 3! ways of ordering the S' s, so each pile will contain 3! -2! slips, and the 
number of distinguishable orderings is 7!/(3! -2!) = 420. 

Sample Problem 5.22. In how many distinguishable ways can you order the let¬ 
ters of the word MISSISSIPPI? 

Solution. There is one M, four /’s, four S' s, and two P's, for a total of 11 letters. 
So the number of orderings is 11 !/(4! *4! *2!) or 34,650. 

Practice Exercise. In how many distinguishable ways can you order the letters 
of the word BAN AN A? 

The technique we have been using can be generalized to the following theorem. 

Theorem 33. The number of distinct permutations of n objects of r types, with k\ 
objects of the first type, ki of the second, and so on, where two objects of the same 
type are considered identical, is ^ , . 

Proof. Apply subscripts to all objects of a given type, so that there are now n distinct 
objects. There are n\ ways to arrange these objects. Now divide by k\ \ to enumerate 
the ways in which the objects of the first type might be rearranged, by ! to rearrange 
objects of the second type, and so on. The result follows. □ 

To represent the fraction arising in Theorem 33 we often write 

( n \ n\ 

k x ,k 2 ,...,k r ) = 

similar to the notation when all elements are distinguishable. 

A set in which some members are indistinguishable (like the sets of letters in 
the word SUCCESS, provided we do not add the subscripts) is called a multiset. 
The set consisting of one element of each type is called the underlying set. If there 
are r distinguishable types of elements, and there are k\ elements of type 1, k 2 ele¬ 
ments of type 2, ..., and k r elements of type r, we say the set is a multiset of type 
(k\, & 2 , ..., k r !). For example, {A, A, A, B, C, C, C} is a multiset of type (3, 1, 3). 
This particular set can also be denoted {3 • A, 1 • B, 3 • C}, and can be ordered in 
( 3 \ 3 ) = 3 |Y| 3 y = 140 ways. The underlying set {A, B.C} has only six orderings. 

It is also possible to have a multiset in which there can be an infinite number of 
elements of one or more of the distinguishable types. For example, if a set contains 
an infinitude of elements indistinguishable from A, an infinitude of elements indis¬ 
tinguishable from B, and three elements indistinguishable from C, we would write 
it as {00 • A, 00 • B, 3C} and say it is of type ( 00 , 00 , 3). 

The problems are significantly harder if not all the available repetitions are used. 
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Sample Problem 5.23. An experimenter has an urn containing 12 marbles: five 
red, two blue, and five green. Assuming that marbles of the same color are indis¬ 
tinguishable, how many different sequences of length 4 can be chosen from the 
set? 

Solution. If unlimited numbers of each color were available, this would be the 
same as the problem of selecting sequences of length 4 from a three-set with un¬ 
limited repetitions allowed, and there would be 3 4 = 81 solutions. It is necessary 
to exclude those with three or four blue marbles. In the obvious notation, these 
are BBBR , BBRB , BRBB , RBBB , BBBG , BBGB , BGBB , GBBB , and 
BBBB , nine in total. So the answer is 81 — 9 = 72. 

Practice Exercise. What is the answer if there are four red, three blue, and five 
green marbles? 


Exercises 5.4 

In Exercises 1 to 12, calculate the indicated quantity. 


5!. 

2. ( 8 ) 3 . 

3. (4) 4 . 

9!. 

5. (5) 4 . 

6. (9) 2 . 

8 !. 

8. ( 8 ) 5 . 

9. (4)i. 

4!. 

11. (5)2. 

12. (7) 2 . 


13. Ten midsize cars are available for rental. Three customers arrive and each 
chooses a midsize car. In how many different ways can the choice be made? 

14. A palindrome is a “word” (any string of letters) that reads the same forward or 
backward, such as CIVIC or A ABA A. How many five-letter palindromes are 
there (using the ordinary, 26-letter alphabet)? 

15. There are 13 contestants. In how many different ways can the judges award first, 
second, and third prize? 

16. How many ways are there of seating six people at a round table? 

17. How many ways are there of seating six people at a round table so that two 
specific people sit together? 

18. Five stereo systems are to be arranged in a line against the wall of the appliance 
department. 

(i) How many ways can this be done? 

(ii) How many ways can they be arranged if the most expensive model must be 
in the middle? 
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19. Three men and four women sit in a row. How many different ways can they do 
it if the following conditions apply. 

(i) The men must sit together? 

(ii) The women must sit together? 

20. The ACME company uses serial numbers consisting of three letters followed by 
five digits. 

(i) How many possible serial numbers are there? 

(ii) How many possible serial numbers are there if no digit can be repeated? 

(iii) How many if no repetitions of digits or letters are allowed? 

21. John likes to arrange his books. He has four western, five mystery, and six 
science-fiction books, all different. 

(i) In how many ways can he arrange them on a shelf? 

(ii) In how many ways can he arrange them if all the books on the same subject 
must be grouped together? 

In Exercises 22 to 29, how many ways can you arrange the letters of the word? 


22. 

BORROW. 

23. 

PROFESSOR. 

24. 

MOOSEWOOD. 

25. 

ACCESSORY. 

26. 

OFFERED. 

27. 

RUBBLE. 

28. 

BOOKKEEPER. 

29. 

DENDRONOID. 


30. A bookseller has four copies of Hearts in Atlantis , five copies of Bag of Bones, 
and three copies of Needful Things in her Stephen King section. Copies of the 
same book are identical. In how many different ways can she arrange them? 

31. Prove that (2 n)\/n\ = 2^(1 x 3 x 5 x • • • x (2 n — 1)), and consequently that, 
for any positive integer n, 

2 x 6 x 10 x • • • x (4 n — 6) x (4 n — 2) 

= (n + 1) x (n + 2) x • • • x (2 n — 1) x 2 n. 

32. Write out a formal inductive proof of the value of (s)k- 

33. You have n different gemstones, and you want to string them together to make a 
necklace. Show that (n — l)!/2 different necklaces are possible. 

34. You have a deck of nine cards: three (identical) aces of spades, three (identical) 
kings of hearts, and three (identical) queens of clubs. In how many different 
ways can you deal a sequence of four cards? 
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Selections 


Given a set S, we are often interested in knowing how many different subsets of a 
given size are contained in S. This depends only on the size of S. We shall write (£) 
or C(s, k) for the number of k-subsets of an s-set; it is usual to read the symbol as “s 
choose kT 


We can use the formula in (5.4) to derive expressions for the numbers C(s, k). 
Suppose S' is a set with s elements. It is clear that every &-set that we choose from 
S gives rise to exactly k! distinct k-sequences on S and that the same k-sequence 
never arises from different k-sets. So the number of k-sequences on S is k! times the 
number of k-sets on S, or 


_ (£)t _ s! 

k\ ~ (s — k)\k\ 


(5.5) 


When calculating (£) in practice, you would usually calculate (s)k, then divide 
by k\. So 

/s\ s • (s — 1) • (s — 2) • • • (s — k + 1) 

W = 1-2-3 • • - k * 


There are k factors in the denominator and in the numerator. 

From the definition of 0! we get (q) = 1, which is quite consistent. It is possible 
to choose no elements from a set, but you cannot imagine different ways of doing it. 
It will be convenient to define (£) = 0 if k > s (again this makes sense; there is no 
way to choose more than s elements from an s-set). 


Sample Problem 5.24. Calculate C(8, 4) and (^). 


Solution. 


C(8, 4) 



8-7-6-5 
4-3-2-1 
6 ! 


70, 


0 ! * 6 ! 


= 1 . 


(There is no need for calculation: the terms 6! in the numerator and denominator 
cancel out.) 


Practice Exercise. Calculate C(9, 5) and (q). 

Sample Problem 5.25. A student must answer five of the eight questions on a 
test. How many different ways can she answer, assuming there is no restriction 
on her choice ? 






Solution. ( 1 ?) =56 ways. 
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Practice Exercise. How many ways can she answer if she must choose five, one 
of which is Question 1 ? 

Sample Problem 5.26. Recall from Section 1.5 that computers read strings of 
binary digits, with every entry 0 or 1. How many 8-bit strings (strings of eight 
binary digits) are there that contain exactly five 1 ’s? 

Solution. To specify a string, it is sufficient to say which positions have l’s. 
There are C( 8 , 5) choices, so the answer is C( 8 , 5) = 56. 

Practice Exercise. How many 8 -bit strings contain exactly four l’s? 

Sample Problem 5.27. How many ways can a committee of three men and two 
women be chosen from six men and four women? 

Solution. The three men can be chosen in ( 3 ) ways; the two women can be 
chosen in (*) ways. Using the multiplication principle, the total number of com¬ 
mittees possible with no restrictions is 



= 120 . 


Practice Exercise. You wish to borrow two mystery books and three westerns 
from your friend. He owns five mysteries and seven westerns. How many differ¬ 
ent selections can you make? 

Sample Problem 5.28. How many different “words ” of five letters can you make 
from the letters of the word 

REPUBLICAN , 

if every word must contain two different vowels and three different consonants? 

Solution. The three consonants can be chosen in ( 3 ) = 20 ways, and the vowels 
in ( 2 ) = 6 ways. After the choice is made, the letters can be arranged in 5! = 120 
ways. So there are 20-6 -120 = 14, 400 “words.” 

Practice Exercise. What is the answer if you use the word 


DEMOCRAT ? 
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An Illustrative Example 

The following example illustrates two important facts. First, the numbers that arise 
in selection problems can be very large—sometimes the intermediate steps are much 
greater than the answer. And second, there are often two (or more) ways to attack the 
same problem. 

Suppose the 16 members of the girls’ track team are to be divided into four teams 
of four members each to train for the relay. Assuming that you do not care who runs 
in which position, how many ways can they be divided? 

Start by arbitrarily ordering the four teams I, II, III, IV. The members of Team 
I can be chosen in C(16, 4) ways. After they are chosen, there are 12 girls left, so 
there are C(12, 4) ways to choose Team II. Team III can be chosen in C( 8 , 4) ways. 
Then Team IV is decided (there are only four girls left). We have 

C(16,4)-C(12,4)-C(8,4), 

arrangements. But the order of the teams did not really matter, so every arrangement 
is one of 4! that all give the same solution. (4! is the number of ways of arbitrarily 
assigning the labels I, II, III, IV to the four teams.) So the answer is 

C(16,4)-C(12,4).C(8,4) _ 16! 12! 8 ! 1 16! 

4! ~ 12!*4! X 8b4! X 4b4! X 4! ” ( 4 !)^ 

(the 12 !’s and 8 !’s cancel). 

Equation (5.5) was useful because of the way the factorials 12! and 8 ! canceled. 

If you try to work out the exact answer to the problem, you will find that 16! is 
greater than 2-10 13 . On many calculators this large number will only be represented 
approximately, and the divisions will give you at best an approximate answer. But if 
you write out 

16-15-14-13-12-1 MO-9-8-7-6-5-4.3-2-1 
(4-3-2 -1)-(4-3-2 - l)-(4-3-2-l)-(4-3-2-1)-(4-3-2 -1) ’ 

and cancel you will get 15-7-13 -11-5-7-5, which comes to 2,627,625. This selection 
problem involved intermediate numbers nearly a million times as great as the solu¬ 
tion. If, instead of canceling the 12! and 8 ! we had calculated the three C -numbers, 
the multiplications would have been very large. 

As we promised, there is another way to handle this problem. This time select 
one girl at random. There are 15 other girls from whom her three teammates can be 
chosen, so this can be done in (* 3 5 ) ways. For each possible choice here, suppose you 
select one of the twelve remaining girls. Her team can be completed in ( 3 1 ) ways. 
Again choose a girl from those remaining; her team can be completed in Q ways 
and the remaining girls must make up the other team. The number of choices is 

15^ fll\ p\ _ 15-14-13 11 • 10-9 7-6-5 

3/ X \3/ X \3/ _ 3-2-1 X 3-2-1 X 3-2-1 ’ 











5.5 Selections 


143 


which comes to the same answer. This time the intermediate numbers were consid¬ 
erably smaller. 

Problems Combining Different Methods 

Some problems can be solved by combining the methods we have been using with 
the rule of sum, which was given in Section 5.2 as 


\s\ = |sn r| + \s\t\. 


(5.2) 


Sample Problem 5.29. In Sample problem 5.27, how many ways are there to 
select the committee if one particular couple, say Mr. and Mrs. Smith, do not 
wish to be on the committee together? 

Solution. The total number of committees possible with no restrictions was 120. 
If Mr. and Mrs. Smith are both on the committee, then the rest of the committee is 
found by choosing two of the five remaining men and one of the three remaining 
women. This can be done in 



ways. So 30 of the 120 possible committees contain both Smiths; when these are 
excluded there remain 90 ways of choosing the committee. 

(We have, in fact, applied the rule of sum: interpret S as the set of all possible 
committees, and T as the set of committees not containing both Smiths. Then we 
found \S\ = 120 and \S D T\ = 30, we required \S\T\, which the rule tells us 
equals 90.) 

Practice Exercise. A group contains three men and seven women. In how many 
ways can a committee of three people be chosen if it can contain no more than 
one man? 

Sometimes both arrangement and selection techniques are applied to the same prob¬ 
lem. 

Sample Problem 5.30. A copying company has eight photocopying machines 
and seven employees who can operate them. There are four identical copying 
jobs to be done. (There are 4000 booklets to be made; each person makes 1000 
copies.) How many ways are there to allocate these jobs to operators and ma¬ 
chines? 


Solution. We shall give two solutions to this problem. 

(i) This is like Sample Problem 5.19, but the jobs are now identical. In that prob¬ 
lem there were P(7, 4) x P(8, 4) arrangements. As the jobs are indistinguishable, 
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the ordering of A , B, C, D is irrelevant, so we divide the number of arrangements 
by 4!. The answer is 


(7-6.5-4) x (8-7.6-5) 
4-3-2.1 


P(7,4) x P(8, 4) 
4! 


which works out to 58,800. 

(ii) There are C(7, 4) ways to choose four operators and C(8, 4) ways to choose 
four machines. When the choice is made, there are 4! ways to assign the four 
workers to four machines. So the answer is 



the same as before. (There is an extra factor 4! in the numerator and in the de¬ 
nominator.) 

Symmetry of the Choice Function 

From (5.5) we see that 



(5.6) 


This equality can also be derived as follows. If we wish to choose k things from a set 
of s, we could just as easily say which k — s will not be included. In other words, to 
specify a subset of S , we could just as easily specify its complement (in the universal 
set S). 

Sample Problem 5.31. Calculate (^g 0 ). 

Solution. From (5.6), 



( O 1 V 

79). 


Pascal’s Triangle 


Theorem 34. For all integers k and s such that 1 < k < s, 
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Proof. Suppose S is an s-set, and suppose v is one particular member of S. Each 
subset of S falls into one of two categories: those that do contain x, and those that do 
not contain v. So we calculate the number of &-sets on S that contain x, and then the 
number that do not. Adding these numbers together, we get the total number of sets. 

The £-sets on S that contain v each have k — 1 other elements, and we get a 
different &-set whenever we choose a different (k — l)-set from those other objects. 
So their number equals the number of (k — l)-sets that can be chosen from S\{v}, 
namely (Jr\)- 

The £-sets on S that do not contain v are precisely the £-sets of and there 

are ( 5 ^ 1 ) of them. Adding, we get the result. □ 

Theorem 34 can be used to generate a table of values of the numbers (£). For 
convenience, we call the top row of the table row 0. In row s we write the values of 



in that order. We write the table in a triangular form: Q occurs a half space to the 
left of (^q 1 ). So the table is as shown. Observe that (£) lies in the table with ( 5 ^ 1 ) 
and ( 5 ^ 1 ) above it, a half-step to the left and a half-step to the right, respectively. So 
Theorem 34 tells us that every term can be constructed by adding together the two 
numbers above it 



This table is called Pascal’s triangle. When the values are substituted in we get 
the following array 

1 

1 1 

1 2 1 
13 3 1 

1 4 6 4 1 

1 5 10 10 5 1 

6 15 20 15 


1 


6 


1 
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Selections from Multisets 

We now look at some problems where we wish to divide up n identical objects into 
k distinct classes. These turn out to be equivalent to problems involving selections 
from multisets (that is, sets with some indistinguishable members). 

Sample Problem 5.32. We are given 12 identical coins and wish to distribute 
them into four distinct boxes. In how many ways can we do this? 

Solution. Suppose we have a distribution. We could then line up the 12 coins 
on a table and insert three markers between coins to indicate which coins go 
into which box. Putting O for a coin and | for a marker, the following diagram 
indicates two coins in the first box, four in the second, and three each in the third 
and fourth: 


oo\oooo\ooo\ooo , 

and if all 12 coins were to go into the first box, the diagram would be 

oooooooooooo\\\. 

Each vertical divider separates the coins that go into one box from the coins 
that go into the next. Each diagram corresponds to a distinct way to arrange the 
coins, and each arrangement of coins is represented by exactly one diagram. So 
the number of diagrams equals the number of ways to arrange the coins. 

There are 12 O’s and three |’s, or 15 symbols altogether, and three of them are |. 
So we choose three places from the 15 to place the dividers. This can be done in 
( 3 5 ) = 455 ways, so this is the number of ways to distribute the coins. 

Practice Exercise. In how many ways can we do this if each box is to receive at 
least one coin? 

The Sample problem generalizes to the following theorem. 

Theorem 35. n identical objects can be partitioned into k distinct classes in 
ways. 

Proof. We create a string of length n + k — 1 consisting of n objects and k — 1 
dividers. The objects that go into class i will be those between divider i and divider 
i +1. Because there are C(n + k — 1, k — 1) such strings, there are C(n + k — 1, k — 1) 
ways to partition the objects. □ 

Theorem 36. If a multiset S contains k types of elements A\, A 2 , ..., A&, and in¬ 
finitely many of each type, then there are 
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fn+k — 1\ fn+k— 1\ 
n J = V *- 1 /’ 

distinguishable ways to select n elements from S. 

Proof. Consider a partition of n identical objects into k distinct classes. We use the 
partition to construct a subset from S. If there are n, elements in the ith class, then the 
subset contains n z members of type A*. Clearly two subsets will be distinguishable if 
and only if the corresponding partitions are different, so the number of distinguish¬ 
able subsets that can be selected equals the number of different partitions, that is, 

(■+') = ttV)- □ 

Sample Problem 5.33. You want to select a dozen donuts from a store. They 
make four kinds that you like: chocolate, caramel, sugar-dusted, and plain. There 
are at least 12 of each kind in stock. How many different selections can you make? 
(Donuts of the same kind are considered indistinguishable.) 


Solution. As there are enough donuts of each kind to make a selection of that 
kind only, the answer will be the same as if the numbers available were infinite. 
So n = 12, k = 4, and the answer is ( 12+4_1 ) = ( 3 5 ) = 455. 

Practice Exercise. You want to select workbooks to hand out to your 14 stu¬ 
dents. The workbooks are identical except for the color of the cover; five colors 
are available. How many different choices are available? 

Sample Problem 5.34. Suppose the donut store has only three caramel donuts 
in stock. How many different selections can you make now? 


Solution. You can choose t caramel and 12 — t from the three other sorts, for 
t = 0, 1,2, 3. So 



— t) + 3 — 1\ 
3-1 ) 



= 91 +78 + 66 + 55 = 290. 


Theorem 35 can also be applied to solutions of equations. 

Theorem 37. The number of nonnegative integer solutions to the equation 

x\ +X 2 H- \-Xk = n, 


is 


n + k — 1 
k — 1 
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Proof. The solution (x\ , X 2 , ..., Xk) corresponds to a partition of a set of n identical 
objects into k distinct classes: jq is the number of elements in the i th class; two 
partitions will be different if and only if the strings (x\ , X 2 , ..., Xk) are different. □ 

Sample Problem 5.35. Find the number of nonnegative integer solutions to x\ + 
*2 + *3 + *4 = 11 - 


Solution. k = 4, n = 11 , so there are ( 11 4 ^ 4 1 *) = (^) = 364 solutions. 

Practice Exercise. Find the number of nonnegative integer solutions to x\ +* 2 + 
x% T - X4 T - X 5 = 13. 


The theorem can also be applied to cases where the restrictions on the variables 
are a little more general. 

Sample Problem 5.36. Find the number of integer solutions to x\ + X 2 + X 3 + 
X 4 = 20 where x\ > 2, X 2 > —2, X 3 > 3 and X 4 > 5. 


Solution. Define y\ = x\ — 2, y2 = X2 + 2, ^3 = *3 — 3, ^4 = X4 — 5. Then 
yi+>2 + y3 +>4 = 20-2 + 2- 3- 5= 12; there are ( 12 +_ 4 f 1 ) = (‘ 3 5 ) = 455 
solutions. 

Exercises 5.5 

In Exercises l to 12, calculate the indicated quantity. 

1. C( 8 , 3). 2. C(7,7). 3. C(9,4). 4. ( 5 ). 

5. (3). 6 . (®). 7. C(7,3). 8 . C(10,2). 

9. C(9,7). 10. (®). 11. 0- 12. (®). 

13. The math department wishes to select four of its 16 members to teach the discrete 
math course. In how many ways can this selection be made? 

14. A test has ten questions, five in part A and five in part B. 

(i) A student has to choose five questions, two from part A and three from 
part B. How many ways can she make her choice? 

(ii) How many ways can she make her choice if she must choose five questions, 
at least two from each part? 

15. The Student Council consists of 6 juniors and 12 seniors. A committee of two 
juniors and three seniors is to be formed. How many ways can this be done? 

16. A regular deck of cards contains 52 cards: 13 spades, 13 hearts, 13 diamonds, 
and 13 clubs. A five-card hand is dealt. 

(i) How many hands contain only spades? 
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(ii) How many hands contain three spades and two clubs? 

(iii) How many hands contain only spades and clubs, at least one of each? 

17. How many 12-bit binary strings with five l’s are possible? How many of them 
start 101? 

18. There are 12 appetizers on the menu, five cold and seven hot. How many ways 
can you choose four different appetizers for your table? In how many cases does 
this include two hot and two cold appetizers? 

19. Ten points A, B, J in the plane are selected, no three of them collinear. How 
many triangles can be constructed with three of these points as the vertices? How 
many of the triangles have A as one vertex? 

20. A club with 24 members wants to elect a committee consisting of President, 
Secretary, and three Ordinary Members. How many committees are possible? 

21. A state lottery requires you to choose five different numbers from {1,2,..., 49}. 

(i) How many possible choices are there? 

(ii) The state then draws six different numbers. You win if all five of your num¬ 
bers are chosen. How many of your possible choices of five numbers will 
make you a winner? 

(In both cases, do not worry about the order in which the numbers are chosen.) 

22. The motor pool has eight drivers and nine vehicles. Four drivers must be assigned 
to four vehicles for today’s duty. How many ways can it be done? 

23. It is required to select, from the set of numbers {1,2, 3, 4, 5, 6, 7, 8}, a subset 
of four that will contain either 1 or 2, but not both. How many selections are 
possible? 

24. Suppose a class of r boys and s girls wishes to choose a committee with k mem¬ 
bers. 

(i) Show that the total number of committees possible is 

r+s\ 

k y 

(ii) Suppose there are to be t boys on the committee. Show that the number of 
possible committees is 



provided 0 < t < k. 

(iii) Use these results to prove that 
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25. Two lines are drawn on the plane, with intersection point A. From one line m 
points are selected, and from the other n are selected, but A is not in either set. 
How many triangles can be formed with vertices among the m + n points? How 
many can be formed from the m + n + 1 points if A is added to the set? 

26. Prove the following identities algebraically. 

(i) = n( n k Z\) whenever 1 < k < n. 

(ii) k(k — 1 )(£) = n(n — l)^!^) whenever 2 < k < n. 

27. (i) It is required to select a committee with k members from a group of n can¬ 

didates, and to select a chairman from the committee. Calculate the number 
of ways this can be done by two different methods. 

(a) First find the number of ways that a committee can be chosen, and 
multiply by the number of ways the chairman can then be selected. 

(b) First find the number of ways that the chairman can be selected, and 
multiply by the number of ways the rest of the committee can be chosen. 

Use the fact that (a) and (b) must give the same answer to prove the identity 
in Exercise 26(i). (We call this a combinatorial proof of the identity.) 

(ii) Find a combinatorial proof of the identity in Exercise 26(ii). 

28. Prove Theorem 34 by induction. 

29. Prove that 



30. Construct the first ten rows of Pascal’s triangle. 

31. You want to buy six donuts from the donut store in Sample Problem 5.33; again, 
there are four kinds that you like and unlimited numbers are available. How 
many different selections are possible? 

32. Find the number of distinguishable selections of size n from a set containing one 
member of type A and infinitely many members of types B and C (members of 
the same type are indistinguishable). 

33. Find the number of distinguishable selections of size n from a set containing 
one member Aq and infinitely many members of types A\, A 2 , ..., A^ (again, 
members of the same type are indistinguishable). 

In Exercises 34 to 37, find the number of integer solutions of the equation, given the 

conditions on the variables. 

34. x\ + X 2 + -T 3 + X 4 = 12, x\ > 0, X 2 > 1, X 3 > 0, X 4 > 2. 

35. x\ + X2 + X3 + X4 = 16, x\ > 0, X2 > 3, X3 > — 1, X4 > 3. 
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36. X\ + X 2 + X 3 = 22, X\ > 3, X2 > 4, X 3 > 5. 

37. *1 + *2 + X 3 + X 4 + X 5 = 18, X\ > 1 , X2 > 3, X 3 > 0, X 4 > —2, X 5 > 5. 

5.6 The Binomial Theorem and Its Applications 

The Binomial Theorem 

Suppose s is any positive integer. We would like to find an expression for (x + y) s 
as a function of x and y. 

Consider the following product of a factors: 


(x\ + yi)-(x2 + yi) • • • (x s + y s ). 


If you expand this you will get 2 s terms, and each will have a factors, with either x* 
or yi as the i th factor. For example, if a = 2 you get 


xix 2 + yix 2 + xiy 2 + yiyi, 


and if a = 3 the product is 

x 1 X 2 X 3 + yix 2 x 3 + xiy 2 *3 + TDM + x\x 2 y3 + TD 2 T 3 + xiy 2 ys + TD2T3- 
In constructing the terms, you make a decisions: 

• from the first factor, take either xi or y \; 

• from the second factor, take either X 2 or y 2 ; 

and so on. 

In the general case, how many terms are there with exactly k x’s? The term with 
X 1 X 2 • • • Xkyk+iyk +2 • • • Ts can ar i se i n one an d on ly one wa Y i n the product: you must 
choose the x term at steps 1, 2, ..., k and the y term at every later step. Similarly, 
the term with x/,x/ 2 • • • Xi k and all other terms y’s occurs if and only if you choose 
x at steps i 1 , ( 2 , ..., 4, and y at the other s — k steps. So the number of terms that 
contain precisely k of the x’s will equal the number of ways of selecting k objects 
from {xi, X 2 , ..., x s ], namely (£). 

Now put xi = X2 = • • • = x s = x and y\ = y 2 = • • • = y s = T- There will be 
exactly (£) terms equal to x k y s ~ k . So 



^=0 


This result is the binomial theorem for positive integer index a. 
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Theorem 38. If x and y are any numbers and s is any positive integer, then 



Once you know the formula, it is easy to prove Theorem 38 by induction. 
The first few cases of the theorem should be familiar to you: 


(x + y) 2 — x 2 + 2 xy + y 2 , 

(.X + y) 3 = x 3 + 3 x 2 y + 3 xy 2 + y 3 , 

(x + y) 4 =x 4 + 4x 3 y + 6 x 2 y 2 + 4xy 3 + y 4 . 


They should also be familiar when one of the two variables is replaced by 1, or when 
a negative sign or constant coefficient is included: 


(1 + x) 3 = 1 + 3x + 3x 2 + x 3 , 

(x — y) 3 = x 3 — 3x 2 y + 3xy 2 — y 3 , 
(2x + y) 3 = 8x 3 + 12x 2 y + 6xy 2 + y 3 . 


The later cases are easily worked out from the formula. 


Sample Problem 5.37. Write down an expression for (1 + x) 6 . 


Solution. The relevant coefficients are (q) = (^) = 1, (^) = (^) = 6 , (^) = 
0 = 15, 0=21. So 


(1+x) 6 



= 1 + 6 x + 15x 2 + 21x 3 + 15x 4 + 6 x 5 + x 6 . 


Practice Exercise. Write down an expression for (x — y) 6 . 

The binomial theorem can be used to find the approximate values of powers of 
numbers close to 1 . 

Sample Problem 5.38. Find the approximate value (to within 10 -3 ) of 1.02 10 . 

Solution. We write 1.02 10 = (1+ 2 x 10 -2 ) 10 . From the binomial theorem, this 
equals 


1 + 10-2-1(T 2 + 45-4- 1(T 4 + 120-8- 1(T 6 + • • •. 


It is not hard to check that, from the third term on, every subsequent term is, 
at most, one-tenth of the one before it (see Exercise 17 in this section). So the 
approximate value is 
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1 0.2 T - 0.018 T" 0.00096 • • • — 1.219 approx. 

Practice Exercise. Find the approximate value of 0.99 8 . 

You should observe the relationship between Pascal’s triangle, in Section 5.5, 
and the binomial theorem. The coefficients in row s of the triangle are precisely the 
coefficients in the expression for (x + y) s . For example, row 6 is 

1,6, 15,21, 15,6, 1. 

(Remember, the first row is row 0.) 


Consequences of the Theorem 


Two interesting summation formulas can be obtained by putting x = y = 1 and 
x = —1, y = 1, respectively, in Theorem 38: 



D-d* 

k =o 



= 0 . 


(5.7) 

(5.8) 


Equation (5.8) can be rewritten as 


i + D-d* 

k =l 



= 0 , 


—p-o 

= D- 1 )*“ 1 (t); ( 5 - 9 > 

this form is sometimes more useful. 


Equation (5.7) gives us an interesting way to work out the number of elements in 
the power set of a set. 


Theorem 39. Suppose S is any set with s elements. The number of subsets of S (that 
is, the number of elements ofV(S)) is 2 s . 

Proof. The number of k-element subsets of S is (£). The total number of subsets 
equals the number of zero-element subsets (1, for the empty set), plus the number 
of one-element subsets, plus the number of two-element subsets, and so on up to S 
itself. So the number is Ylk=0 (£)> an d the result follows from (5.7). □ 
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Sample Problem 5.39. How many ways are there to choose three or more people 
from a set of 11 people ? 


Solution. There are 2 11 possible ways to choose a subset of the 11 people. How¬ 
ever, the subsets with 0, 1, or 2 elements are not allowed. So the number is 


2 11 — 





= 2048- 1 - 11 -55 = 1981. 


Practice Exercise. How many ways are there to choose at most seven books 
from a selection of ten books? 


Sample Problem 5.40. Evaluate Ylk =o £(£)• 

Solution. We first evaluate J2k =o &(£)■** anc * ^ en P ut x = 1- Since 


we have 


s\kx k 


kl(s-k)! 

s!x k 


(k-iy.(s -ky. 

(s - \)\x k 
s (k- 1)!0 -k)\ 


5 - 1 

k- 1 


Jfc-1 




Make the substitution h = k — 1. The right-hand side becomes 

5—1 




E 

h =0 


J - 1 

h 




which equals sv(l + x) s 1 . Putting x = l, 


k=0 v 7 k= 1 


: 2 S_1 S. 


(The fact used at the end of the sample problem, that the term (namely sk(fy) is zero 
when k = 0 , is often useful in changing between an expression of the form Y^k =0 
and one of the form J2k =l •) 


Exercises 5.6 

In Exercises 1 to 10, use the binomial theorem to calculate the indicated quantity. 
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1. (x - l) 4 . 

3. (1-2 z) 5 . 

5. (2x + y) 4 . 

7. (x+x” 1 ) 4 . 

9 . (x + ;y + z) 3 . 


2 . (x - 3y) 3 . 

4. (1+x 2 ) 3 . 

6. (2x — 5y) 3 . 

8 . ( 1 +x 2 ) 2 . 

10. (1 + 2 x -y) 3 . 


11. How many 8 -bit strings (strings of eight binary digits) are there? 

12. Use the binomial theorem with x = —l and y = 1 to prove that 


+ 


and consequently that 


+ 


5 ) 

")2 n 


+ • 


+ 


+ 


+ 


+ • 


+ 


-> 5-1 


+ ' 


13. Prove ELo(T')=2 : 

14. Use the answer to the preceding exercise to find the number of distinguishable 
k -sets that can be chosen from a set S of 3k + 1 objects, if k members of S are 
indistinguishable and all other members can be distinguished. 

15. Prove Theorem 38 by induction. 

16. Show that 

n+r —1 


X I ' — 1 / 7 \ 

£ Cl) 


n + 1 
r + 1 


Hence show that, if we consider all possible &-sets chosen from the set {1, 2, 

..., n}, the average value of their smallest elements is 

n + 1 
r + 1 

17. Verify the assertion that, in the binomial expansion of (1 +2 x 10 -2 ) 10 , each term 
from the third onward is at no greater than one-tenth the size of the preceding 


one. 
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5.7 Some Further Counting Results 

Principle of Inclusion and Exclusion 

We are now in a position to prove Theorem 28, the principle of inclusion and exclu¬ 
sion. We recall its statement. 

The number of elements in X\ U X 2 U • • • U X n is 

n n n 

E IV 1 nx (2 | + ... 

6=1 6=16=6+1 

n n n 

+ (-i) t " 1 E E E l I ii n 4n--ni it | 

6=1 6=6+1 6=6—1+1 

+ ••• + (-lf+X;, nx i2 n-nx;j. (5.10) 

Proof. Suppose the object x belongs to r of the sets, namely Xj x , Xj 2 , ..., Xj r . Then 
x is counted r times in calculating Jf\Xi l \, and contributes r to that part of the sum 
in (5.10). It appears Q times in subsets of the form X^ Pi Xi 2 since it appears in 
precisely those for which {//, z' 2 } is a two-set of {j/, 72 , ..., yV}; so it contributes 
— ( 2 ) to that part of (5.10). Continuing, we find its total contribution is 

+ 11 ) 

which equals 1 by (5.9). (The sum in (5.11) stops at k — r because x cannot occur in 
any intersection of r + / or more sets.) So the sum in (5.10) contains a contribution 
1 for every member of X\ U X 2 U • • • U X n , and its total is |Xi U X 2 U • • • U X n |. □ 

It is easiest to think of (5.10) as the 

sum of (the sizes of) the sets, minus sum of the (sizes of) intersections of two sets, 
plus sum of (the sizes of) the intersections of three sets, minus .... 

There are n terms in the first summation, Q) in the second, (") in the third, and so 
on. This is particularly useful when all the sets Xi are the same size, as are all the 
sets Xi x f! Xi 2 , and so on. If the intersection of any k of the sets Xi always has Sk 
elements, then (5.10) becomes 

ns\ - (^j s 2 + -+■ (5.12) 

Sample Problem 5.41. A social security number is a string of nine digits. If any 
digit can be used in any position, how many such numbers are possible? How 
many are there that contain all the odd digits ? 
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Solution. There are 10 9 possibilities. To solve the second part, it is easiest to 
find out how many do not contain all the odd digits, then subtract the answer 
from 10 9 . Write S x for the set of all possibilities that do not contain v. Then we 
want |Si U S 3 U S 5 U S 7 U S 9 I. For any x, |S*| = 9 9 , if x / y, \S X D S y \ = 8 9 . 
Similarly \S X D S y D S z \ = 7 9 , and so on. So, from (5.12) the sum is 



and the answer we required is 


10 9 - 5-9 9 + 10-8 9 - 10-7 9 + 5-6 9 - 5 9 . 


Practice Exercise. How many possible social security numbers contain all the 
digits 6 , 7, 8 , and 9? 

Counting Powers 

There are some interesting applications of the principle of inclusion and exclusion in 
elementary number theory. For example, suppose we want to know how many of the 
numbers from 1 to n are perfect squares, perfect cubes, or any perfect higher power. 
We illustrate the technique for finding this in a relatively small case so that the result 
can be checked by hand, but the method can be applied in general. 

We shall find out how many of the numbers {1,2,..., 100} are perfect squares, 
perfect cubes, or perfect higher powers. The number 1 is a perfect nth power for 
every n. For convenience, we shall find the number of perfect powers other than 1, 
and add 1 to the answer. In this case we need only consider up to sixth powers: 
2 7 = 128 > 100 , so every seventh or higher power (other than 1 ) is greater than 100 . 

Write Xi for the set of all ith powers in the range{2, 3, ..., 100}, for i = 2, 3, ..., 6 . 

Then, from the theorem, the number of integers that belong to at least one of these 
sets is 

\X 2 \ + \X 3 \ + \X 4 \ + \X 5 \ + \X 6 \ 

- (|X 2 n x 3 | + |X 2 n x 4 | + • • • + |x 4 n x 6 | + \x 5 n x 6 \) 

+ (|x 2 n x 3 n x 4 | + |x 2 n x 3 n x 5 | + • • • + |x 4 n x 5 n x 6 |) 

- (\x 2 n x 3 n x 4 n x 5 \ + ■ ■ ■ + |X 3 n x 4 n x 5 n x 6 \) 

+ |x 2 nx 3 nx 4 nx 5 nx 6 |. 

All of these sets and their sizes are easy to calculate. The largest square up to 100 is 
10 2 , so X 2 = {2,3,4, 10 } and |X 2 | = 9, X 3 = {2,3,4} so |X 3 | = 3, |X 4 | = 2 , 

|Xs | = \Xg\ = 1. In general, Xi fl Xj = A/, where l is the least common multiple of 
i and j, and similarly for intersections of three or more, so X2 fl A3 = X2 (T X^ = 
A 3 H A 6 = A 2 H A 3 Pi A 4 = A 2 H A 3 Pi A 6 = Xg, X 2 fl A 4 = A 4 , and all the others 
are empty. Therefore the total is 
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\X 2 \ + l*3l + l*4l + |X 5 | + |X 6 | 

- \x 2 n x 3 \ - \x 2 n x 4 \ - \x 2 n x 6 \ - \x 3 n x 6 \ 

+ \x 2 nx 3 n x 4 \ + \x 2 nx 3 n x 6 \, 

which equals 9 + 3 + 2+1 + 1 — 1—2—1 — 1 + 1 + 1 = 13. So the answer to the 
original question is 14 (adding 1 for the integer 1). 

The same technique can be applied to problems about factors; see the exercises. 


Derangements 


A derangement of an ordered set of objects is a way of rearranging the objects so that 
none appears in its original position. A derangement of (1, 2, ..., n), for example, is 
an arrangement (oq, a 2 , ..., a n ) of the first n integers in which ofj = i never occurs. 
We write D n for the number of possible derangements of n objects; clearly D n does 
not depend on the type of objects in the set. 


Theorem 40. For every positive integer n, 


D n = n\ 


111 

1 _ l! + 2! _ 3! + - h (_1) 


1 

n\ 


Proof. Write S z - for the set of all arrangements (a\,a 2 ,...,«„) of (1,2 , ,n) in 

which at = i. We want to know how many arrangements are not in S z for any i , so 
we want 


n\- |Si US 2 U---US n [ ( (5.13) 

Each arrangement with ot\ = 1 can be derived from an arrangement of (2, 3 , ,n) 

by putting 1 in front of it, and each arrangement of (2, 3, ..., n) gives precisely one 
member of Si so 

|Si| = (n-l)!. 


Each of the sets Si has the same number of elements as Si, so 

n 

53 is< ! = /»!. 

i=1 

Similarly, Si D S 2 H • • • D Sj has (n — j)\ elements. The intersection of any j sets 
has the same size, and there are (") ways of choosing j sets. So 

EE---El 5 ' 1 n^n---n5 ! -.| = (”)(n-y)! = ^. 


The theorem follows upon substituting this formula into (5.10) and applying the 
result to (5.13). □ 
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For consistency we define Do = 1. The other small derangement numbers are 
easily calculated. 

Sample Problem 5.42. Calculate D\, D2, D 3, and D4. 

Solution. Obviously D\ = 0 and D2 = 1. From the formula (or from listing 
cases) we get D 3 = 2 and D 4 = 9. 

If the i th element of the ordered set S occurs in position i of a permutation of S , 
we say i is a fixed point of the permutation. For example, the set Si in the proof of 
Theorem 40 is the set of all permutations of S with fixed point i . 

Suppose the number of permutations of an n set with exactly k fixed points is 
f(n,k). To calculate f(n,k) we first ask in how many ways could one choose the k 
positions to remain fixed? The answer is obviously Q). The other n — k entries must 
form a derangement, so they can be filled in D n -k ways. So the number is {^)D n -k. 

Sample Problem 5.43. How many permutations of {1, 2, 3, 4} have fewer than 
three fixed points? 

Solution. Let P t be the set of permutations with i fixed points. Then the answer 
is 


\Po\ + \Pi\ + \Pi\ = /(4, 0 ) + /(4, 1 ) + /(4, 2 ) 



= 9 + 4-2 + 6-1 =23. 


Practice Exercise. How many permutations of {1,2, 3,4} have two or more 
fixed points? 

A number of theorems can be proved about the derangement numbers. We 
present one example. 

Theorem 41. D n — (n — l)(D n _i + D n _ 2 ). 

Proof. The D n derangements of {1,2, ..., n} can be partitioned into n — 1 classes 
S\, S — 2, , S n - 1 , where Sk consists of all the derangements with last element k. 

(No derangement will have last element n.) We shall prove that each set Sk has 
D n -\ + D n -2 elements. 

The elements of Sk can be classified further. Let Ak be the subset of Sk consisting 
of all the derangements in Sk with kth element n and Bk be the set of those whose 
kth element is not n. 


160 5 The Theory of Counting 


Of the members of A&, if entries n and k are deleted, the remaining elements 
must be a derangement of the ( n — 2)-set T = {1, 2,..., k — 1, k + 1,..., n — 1}, 
and different arrangements give rise to different members of So the number of 
elements of A^ equals the number of derangements of T, or D n - 2 . The members of 
Bk are just the derangements of {1, 2 , , k — 1, n, k+1 , ..., n — 1}, so \ Bk\ = Az-l- 

So |Sjfc| = \A k \ + \B k \ = D n —2 + D n - 1 , and D„ = \S k \, giving the result. 

□ 


The Pigeonhole Principle 

If more than n pigeons are roosting in n pigeonholes, then at least one pigeonhole 
contains more than one pigeon. This idea applies to sets and other mathematical 
objects, and is called the pigeonhole principle. 

Theorem 42. (i) If an s-set is partitioned into n disjoint subsets , where s > n, then 
at least one of the subsets contains more than one element. 

(ii) Suppose a\, 02 ,..., a n are real numbers such that YTi =1 a i — Then for at 

least one value ofi, i = 1 , 2 , ... ,n, at geb/n. 

Proof, (i) Suppose the statement is false. Let 

S = S\U S 2 U • • - u Sn, (5.14) 

be a partition of S , where |Sj = s > n. Si D Sj = 0 whenever i ± j, Si / 0, and 
|5j| < 1, for i = 1,2 ,... ,n. But then \Si\ = 1 for i = 1, 2,..., n, and by (5.14) 
s — 15*1 = || 4“ 1 5 * 2 1 + • • • + |5i| = 1 ~I - 1 ~I - * * * —|— 1 = n. Since s > n , this is a 
contradiction, (ii) Again suppose the statement is false, so that we have at <b/n for 
every i, i = 1, 2,..., n. Then 

n 

Yai < b/n + b/n + • • • + b/n = b. 
i =1 

But Jf!i =1 a i > ^ an( i again we have a contradiction. □ 

Most applications of the pigeonhole principle involve the appropriate partitioning 
of a set into subsets. Sometimes we must also use some special feature of the problem 
and sometimes we need repeated applications of the pigeonhole principle. 

Sample Problem 5.44. Suppose A = {a\, 02 , ..., < 27 } is a set of seven different 
positive integers none of which exceeds 24. Show that the sums of the nonempty 
subsets of A cannot all be distinct. 

Solution. Since A is a seven-set, it has (j) + Q + ( 3 ) + ( 4 ) = 98 nonempty sub¬ 
sets with, at most, four elements. The lower bound on the sums of the nonempty 
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subsets is obviously 1; the upper bound on the sums of the k- sets, for k < 4, is 
21 + 22 + 23 + 24 = 90. Since we have 98 sums that can take at most 90 values, 
at least two sums must be equal. (Notice that we did not need to look at all the 
subsets, only those of size 4 or smaller.) 

Practice Exercise. Show that in a set of eight people there are at least two whose 
birthdays fall on the same day of the week this year. 

Sample Problem 5.45. Suppose 70 students are studying 11 different subjects, 
and any subject is studied by at most 15 students. Show that there are at least 
three subjects that are studied by at least five students each. 

Solution. Since 70 students take 11 subjects, there must be at least one subject 
with at least seven students; call it subject A. At most 15 students take A, so at 
least 55 students study the remaining 10 subjects. Hence at least one of these 
subjects has at least six students; call it subject B. At most 15 students take B , 
so at least 40 students study the remaining nine subjects. Hence at least one of 
these subjects has at least five students; call it subject C. Then A, B, C are the 
required subjects. 

Sample Problem 5.46. A student solves at least one problem every day. He 
makes it a rule never to solve more than 730 problems per year. Show that he 
solves exactly n problems in some set of consecutive whole days for any given 
positive integer n. 

Solution. Consider first the case where n — 1; this gives us a clue to the solution. 
Suppose the student solves exactly two problems per day, throughout any year 
with 365 days. So he solves 730 problems in the whole year and on no one day 
has he solved exactly one problem. But during a leap year, solving exactly two 
problems per day leads him to exceed his quota: there are at least two days of a 
leap year on each of which he must solve exactly one problem. Since leap years 
include all those with numbers divisible by 4 but not by 25 (as well as those with 
numbers divisible by 400), any 8 consecutive years include at least one leap year 
and contain at least (8 x 365) + 1 = 2921 days. We start from some fixed date 
and let S\ be the number of problems solved in the first i days from the starting 
date. Consider the sets of integers 

S = {Si : 1 < i < N = 2921 n), 


and 


T = {n +Si : l <i <N = 2921 n}. 


Notice that Si Sj when i j since the student solves at least one problem per 
day. Similarly n + Si ^ n 4- Sj when i j. Now 


162 5 The Theory of Counting 


\S\ + \T\ = 2 N = 5842 n; 

all the elements of S and T are positive integers, the largest of which is n + S , 
and 


n + S n < n + ( 8 n x 730) = 5841ft. 

Hence there exists at least one positive integer j belonging to both S and T. So 
for some h and k , 

y = 5/j = n + S k , 

which means that on the days k + l,k + 2, ..., h, the student solved exactly n 
problems. 


Exercises 5.7 

1. How many elements are there in the union of the four sets X\X 3 , X 4 if each 
of the sets has precisely 28 elements, each intersection Xi D Xj has precisely nine 
elements, each triple of sets has exactly one element in common, and there is no 
element common to all four sets? 

2. Set Xi has 20 elements, for i = 1, 2, 3, 4. X\ and X 4 are disjoint; every other 
intersection of two of the sets has ten elements. | 2 fi H X 2 H X 3 1 =5, \X2 H X^ H 
X 4 | = 3. What is the size of IiUI 2 UI 3 U X A 1 

3. Find the number of perfect powers of integers (other than first powers) 

(i) between 1 and 200 (inclusive), 

(ii) between 1 and 500 (inclusive). 

4. How many integers in the range from 1 to 250 are divisible by at least one of 2, 
3, and 5? How many are divisible by at least one of 2, 3, 5, and 7? 

5. How many positive integers less than 60 are relatively prime to 60? 

6. Find the number of (positive) prime numbers less than 100. 

7. Calculate D 5 and D§. 

8 . Calculate D 7 and Dg. 

9. Seven job applicants are to be interviewed by seven members of a hiring com¬ 
mittee. Each will be interviewed twice. To avoid bias, each candidate is assigned 
their first interviewer at random; the second interviewer is also chosen at ran¬ 
dom, but no candidate is to get the same interviewer twice. 

(i) How many different ways are there to schedule the first round of interviews? 

(ii) How many different ways are there to schedule the whole process? 

10. Prove that, if ft > 2, D n — nD n -\ = Show that Theorem 40 can be 

proved from this result. 
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11. Prove that D n+ \ is even if and only if D n is odd. 

12. Our mailman is not very good at his job. When he delivers letters to the six 
apartments in our building, he delivers them at random. Today there is exactly 
one letter addressed to each apartment, and he delivers exactly one to each apart¬ 
ment. How many possible ways are there for him to deliver the letters? In how 
many of these does he get the following results: 

(i) none right? (ii) at least one right? 

(iii) exactly five right? (iv) all of them right? 

13. At a party, n people are seated in a room with n + 1 chairs. They go to another 
room for supper. When they return and sit down again, it is found that no one 
occupies the same chair as before. Show that this can occur in precisely D n + 
D n +1 ways. 

14. A = {ai, < 22 , ..., a$} is a set of five distinct positive integers, none of which ex¬ 
ceeds 9. Show that the sums of the nonempty subsets of A cannot all be distinct. 

15. How many times must we throw two dice in order to be sure that we get the same 
total score at least six times? 

16. Suppose n objects are distributed between b boxes in such a way that no box is 
empty. How many objects must we choose to be sure that we have selected all 
the contents of at least one box? 

17. In September, October, and November, a student solves at least one problem 
per day but not more than 13 problems per week. Show that she solves exactly 
12 problems on some set of consecutive whole days. 


6 


Probability 


We use the word “chance” frequently in our everyday conversation. For example, we 
meet somebody “by chance,” we “chance upon” the solution to a problem, one team 
has a “better chance” of reaching the Super Bowl than another, the Weather Channel 
announces a “30% chance” of precipitation. 

There are two ideas here. In every case there is the feature of unpredictability— 
a chance occurrence is one for which we cannot be certain of the outcome. The other 
feature is that sometimes chance is quantitative —either it can be measured exactly 
(the weather forecast) or else we can at least say one “chance” is greater than another 
(the football teams). We shall refer to these two aspects of chance as randomness and 
probability , respectively. 

For example, say you flip a coin. There is no way to tell whether it will fall heads 
or tails, so we would say this is a random occurrence. If the coin is made uniformly, 
so that it is equally likely to show heads or a tails, we normally call it a fair coin, and 
we say the probability of a heads is and so is the probability of a tails. (Sometimes 
we say “50%” instead of “^.”) 

Suppose the coin in our example was not uniform; for example, say it was made 
as a sandwich of disks of metal, like a quarter, but the different disks were of different 
densities. It might be that, if we flipped it enough times, heads would come up 70% 
of the time and tails only 30%. Then we would say the probability of a heads is 
However, this still qualifies as random because no one knows beforehand whether a 
particular flip will be one of the (more common) ones that results in a heads or one 
that gives a tails. The probabilities do not have to be equal in order for an event to be 
random. 

To discuss randomness formally, we shall use the idea of an “event” just as we 
did in Section 5.1. The ideas and notation of set theory—and in particular the number 
of elements—will be very useful. 


W.D. Wallis, A Beginner's Guide to Discrete Mathematics , 

DOI 10.1007/978-0-8176-8286-6_6, © Springer Science+Business Media, LLC 2012 
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6.1 Probability Measures 


Probability Distributions 

Consider a random event: for example, the fall of a heads when a fair coin is flipped. 
We do not know whether a heads will show on any particular flip, but we expect to 
see a heads half the time when a large number of attempts are made. We use the 
word “probability” for this numerical measure of the likelihood of an event. The 
probabilities 1 and 0 indicate absolute certainty and impossibility, respectively. 

We shall write P(E) for the probability that the event E occurs. For example, if 
H means “a heads shows” and T means “a tails shows” when a fair coin is flipped, 
then 

P{H) = P(T)= 1 -. 

A list of the probabilities of all outcomes of an experiment is called the proba¬ 
bility distribution of the experiment. 

One important case is where each of the possible outcomes is equally likely, as 
in the case of the fair coin. In this case we say the experiment has equally likely 
outcomes , and set their probabilities equal. In this case the probability distribution 
is called uniform , and we also refer to the experiment as a uniform experiment. If a 
uniform experiment has a sample space with n elements, then we shall assign prob¬ 
ability ^ to each outcome. 

If a uniform experiment has sample space S , then an event E will occur if the 
outcome is one of the \E\ outcomes in E. We expect that E will occur in fraction 
\E\/\S\ of cases, if the experiment is repeated. So we define 



Sample Problem 6.1. A fair die is rolled. Si is the event that the number i is 
rolled for i = 1, 2, 3, 4, 5, 6. E means the roll of an odd number, and F the roll 
of a number less than 3. Find the probabilities of these events. 

Solution. Since the die is fair, the outcomes are equally likely: 


P(Si) = P(S 2 ) = P(S 3 ) = P(S 4 ) = P(S 5 ) = P(S 6 ) = 

6 

E and F contain three and two outcomes, respectively, so P(E) = | = ^ and 
p W = 1 = 3- 


Practice Exercise. A single die is rolled. What are the probabilities of the fol¬ 
lowing events? 
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E : A 4 or a 5 is rolled. 

F: An even number is rolled. 

G: An odd number greater than 2 is rolled. 


If E is any event in a uniform experiment, then P(E) will be a proper fraction. If 
E and E are two disjoint events in such an experiment, then the event E U E (“either 
E or F occurs”) has \E\ + I/ 7 ! elements, so 


P(E U F) 


\E\ + \F\ 

ISI 


\E\ \F\ 

— + — = P(E) + P(F). 
|5| \S\ 


We shall now apply these ideas to the general situation to make a general definition 
of probability. 

Consider an experiment with sample space S = {s\, S 2 , ..., s m }. A probability 
distribution for the experiment is a function P with the following properties: 


1. For each Si, 1 < i < n, Pfa ) is a real number and 0 < P(st) < 1; 

2. P(s\) + P(sf) + • • • + P(s m ) = 1. 


Sample Problem 6.2. A black die and a white die are thrown simultaneously. 
What is the probability that the sum of the numbers shown is 8 given that the dice 
are fair? 


Solution. Write (jc, y) to mean that x shows on the black die and y shows on the 
white die. Then there are 36 possible outcomes, namely (1,1),(1,2),...,(1,6), 
(2, 1),..., (6, 6); they are equally likely. Five of them—(6, 2), (5, 3), (4, 4), 
(3, 5), and (2, 6)—have a sum of 8. So 


P(E) = 


\_E\_ 

ISI 


5 

36’ 


Practice Exercise. A quarter and a nickel are flipped simultaneously. What is 
the probability that exactly one heads shows, assuming that both coins are fair? 


Sample Problem 6.3. A deck of cards is shuffled and one card is dealt. What is 
the probability that it is a spade? 


Solution. There are 52 cards in a deck of which 13 are spades. So 


P(E) = 


\_E\_ 

|S| 


13 _ 1 
52 “ 4' 


Practice Exercise. A deck is shuffled and one card dealt face up. What is the 
probability that it is a picture card (king, queen, or jack)? 
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El = {11} 

Ei\ = 1 

P(Ei) = 4 

£3 = {12, 21} 

£31 = 2 

P(E 3 ) = 4 = yg 

E 4 = {14,22,31} 

£ 4 | = 3 

P(E 4 ) = 4 = 4 

E 5 = {14,23,32,41} 

II 

in 

fc] 

P(E 5) = 4 = 5 

E 6 = {15,24, 33,42,51} 

£ 6 | = 5 

P(E 6) = 4 

E 1 = {16,25,34, 43,52,61} 

II 

t" 

P ( E l) = 4 = 5 

F 8 = {26, 35, 44, 53, 62} 

£gl = 5 

P(E$) = f 6 

E 9 = {36, 45, 54, 63} 

Eg 1=4 

'O 

II 

II 

E m = {46, 55, 64} 

O 

II 

p (E 10) = 4 = 4 

E\\ = {56, 65} 

£nl = 2 

^n) = 4 = 4 

E\2 = {66} 

£l2l = 1 

p (En) = 4 


Table 6.1. Probabilities when rolling two fair dice 


Nonuniform Experiments 

Not all experiments are uniform. But in many cases, given an experiment A, we can 
find a uniform experiment B such that the outcomes of A are events (not necessarily 
simple) of B. 

For example, consider an experiment in which two fair dice are rolled, and the 
total of the points on them is recorded. This experiment has 11 possible outcomes 
S2, S 3 ,..., si2; Si means the total is i. To calculate the probability of 57, we shall look 
at a slightly different experiment. In this one, two fair dice are rolled and the result 
is recorded as an ordered pair of digits—(1, 3) means “1 on die 1, 3 on die 2.” (For 
brevity, one commonly writes “13” instead of “(1, 3)”.) There are 6 2 = 36 outcomes, 
and each has probability ^. In this experiment we write Ei for the event that the two 
numbers showing add to i . Then 


P(E) = 


\Ei\_ 

36 ’ 


and we can calculate all the probabilities; see Table 6.1. But in each case the proba¬ 
bility of the outcome 57 in the first experiment is obviously equal to the probability 
of Ei in the second experiment. So P{sf) = 3 ^, P(s$) = and so on. 


Sample Problem 6.4. Three coins are flipped and the number of heads is re¬ 
corded. What are the possible outcomes and what is the probability distribution? 


Solution. The outcomes are the four numbers 0, 1,2, and 3. To calculate proba¬ 
bilities, suppose three coins were flipped and all results recorded. Ei is the event 
“there are i heads showing.” Then 15*1 = 8 and 
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E 0 = {TTT}, 

P(£o) = 

Ex = {HTT, THT, TTH}, 

P(£l) 4 

E 2 = {HHT , HTH , THH }, 

p(e 2 ) = l 

E 3 = {HHH}, 

PiE 3) = i 

Practice Exercise. A game is played using two dice each of which has the num- 


bers 1, 2, 3 on its faces (so each number appears twice per die). An experiment 
consists of rolling the dice and adding the numbers showing. What is the sample 
space? What is the probability distribution? 

Sample Problem 6.5. There are five marbles—two blue, three red—in a box. 
One is selected at random. What is the probability that it is blue ? 


Solution. If the marbles were marked v,w,x,y,z, where v and w are blue and 
the others are red, then the event “blue is chosen” is {v, w} and its probability is 


P(E) = 


\_E\_ 

|S| 


2 

5' 


Practice Exercise. There are two red, three white, and four blue marbles in a 
box. One is drawn at random. What is the probability that it is not blue? 


Nonuniform Probabilities 

Even when the outcomes are not equally likely, we calculate the probability of an 
event from the formula 

p(E) = j2 p (v. 

seE 

Several important facts can be deduced from this; in particular, if E is any event, 
0 < P(E) < 1; P(E) = 0 if and only if E is impossible, and P(E) = 1 if and only 
if E is certain. If E and F cannot both happen—they are mutually exclusive events , 
as defined in Section 5.1—then 

P(E U F) = P(E) + P(F), 


and for general E and F, 

P(E U F) = P(E) + P(F) - P(E H F). 
The probability of the complement E of E is 


P(E) = 1 - P(E). 
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Sample Problem 6 . 6 . E and F are events in a sample space with P(E ) = 0.6, 
P(F) = 0.4, and P(EHF) = 0.2. What is P(E U F)? What is P(E)? 

Solution. 


P(E U F) = P(E ) + P(F) - P(E H F) 

= 0.6+ 0.4+ 0.2 

= 0 . 8 , 

P(E) = 1 - P(E) 

= 1 - 0.6 
= 0.4. 

Practice Exercise. E and F are events in a sample space with P(E) = 0.7, 
P(F) = 0.3, and P(E H F) = 0.1. What are P(E U F), P(E)7 

These probabilities can be represented in a Venn diagram—in each area of the 
diagram write the probability of the corresponding event. Then the answer can be 
obtained by addition. The Venn diagram for the preceding Sample Problem is 



Sample Problem 6.7. A discrete mathematics class is restricted to business and 
math majors, all freshmen or sophomores. The percentage makeup of the class is 

Freshman business majors 30%, 

Freshman math majors 35%, 

Sophomore business majors 21%, 

Sophomore math majors 14%. 

A name is chosen at random from the class list. What are the probabilities of the 
following events ? 

(i) The student is a freshman. 

(ii) The student is not a freshman. 

(iii) The student is either a freshman or a math major. 

Solution. We write F for the event that the student is a freshman, S for sopho¬ 
more, B for business major, and M for math major. Then the data mean 


P(F(1B) = 0.3, 
P(SHB) = 0.21, 


P(F n M) = 0.35, 
P(SHM) = 0.14. 
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(i) Since M = B in this problem (all students are math or business majors and 
none can be both or else the total would add to more than 100%), the usual 
equation 

F = (FnB)U(FnB), 

becomes 

F = (F n B) U (F n M), 
and since this is a disjoint union, 

P(F) = P(F FB) + P(F H M) 

= 0.3 + 0.35 
= 0.65. 

(ii) p(F) = 1 - P(F) = 1 - 0.65 = 0.35. 

(iii) FUM = FU(Sn M), and this is a disjoint union, so 

P(F U Af) = P(F ) + P(S H M) 

= 0.65 + 0.14 
= 0.79. 


The data can also be represented as 



B 

M 

F 

0.30 

0.35 

S 

0.21 

0.14 


The questions can then be answered by adding the probability in the appropriate 
cells of the diagram. For example, 

P(F U M) = 0.30 + 0.15 + 0.14 = 0.79. 

Practice Exercise. An entomologist has found that the butterflies in a certain 
area can be classified as follows: 

striped males 19%, 

striped females 22%, 

unstriped males 28%, 

unstriped females 31%. 

Represent the data in a diagram. Assuming the butterflies appear at random, what 
is the probability that the next butterfly to be sighted will be (i) striped, (ii) fe¬ 
male, (iii) either striped or female? 
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Exercises 6.1 

1. A fair coin is flipped four times. Find the probabilities that 

(i) at least three heads appear, 

(ii) an even number of heads appear, 

(iii) the first result is a heads. 

In Exercises 2 to 1 two fair dice are rolled. What is the probability of the indicated 
event? 

2. The total is 9. 

3. The total is 6. 

4. The total is even. 

5. The total is odd. 

6. Both scores are even. 

7. One odd and one even number are shown. 

8. A box contains 12 cards, one for each month of the year. A card is drawn at 
random. 

(i) What is the probability that the selected card is March? 

(ii) What is the probability that the selected card is from a month with an r in its 
name? 

9. A box contains six red and four white balls. One ball is drawn at random. What 
is the probability that it is white? 

10. A box contains seven red and five white balls. One ball is drawn at random. What 
is the probability that it is white? 

11. In the game of roulette, a wheel is divided into 38 equal parts, labeled with the 
numbers from 1 to 36, 0 and 00. The spin of the wheel causes the ball to be 
randomly placed in one of the parts; the chances of the ball landing on any part 
are equal. Half of the parts numbered from 1 to 36 are red and half are black; the 
0 and 00 are green. If the chances of the ball landing in any one part are equal, 
what are the probabilities of the following events? 

(i) The ball lands on a red number. 

(ii) The ball lands on a black number. 

(iii) The ball lands on a green number. 

(iv) The ball lands on 17. 

(v) The ball lands on a number from 25 to 36 inclusive. 

12. In a lottery there are 90 losing tickets and 10 winning tickets. You draw one 
ticket at random. What is the probability that it is a winner? 
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13. The moose population in a Canadian park are 45% plain brown, 35% mottled, 
and 20% spotted. One moose is captured at random for the Bronx Zoo. What are 
the probabilities that 

(i) it is spotted? 

(ii) it is not spotted? 

(iii) it is not mottled? 

14. A random number from 0 to 99 is chosen by a computer. What are the probabil¬ 
ities of the following events? 

(i) The number is even. 

(ii) The number ends in 5. 

(iii) The number is divisible by 11. 

15. Box A contains tickets numbered 1, 2, 3, 4. Box B contains tickets numbered 2, 
3, 4, 5. One ticket is selected from each box. 

(i) List all the elements of the sample space of this experiment. 

(ii) Find the probability that the tickets have the same number. 

(iii) Find the probability that the sum of the two selected numbers is even. 

16. A box contains one red, two blue, and two white marbles. One marble is selected 
at random. What are the probabilities that it is 

(i) red? 

(ii) blue? 

(iii) white? 

17. A box contains four red, two blue, and three white marbles. One is selected at 
random. What is the probability that it is 

(i) blue? 

(ii) not blue? 

18. There are 20 students in a class. There are 12 men (eight physics and four chem¬ 
istry majors) and eight women (five physics and three chemistry majors). One 
student’s name is selected at random. What is the probability that 

(i) the student is a chemistry major? 

(ii) the student is male? 

19. A sample space contains two events, E and F, and 

P(E) = 0.70, P(F)= 0.25, P(EnF) = 0.15. 

Determine 

P(E), P(EUF), P(EUF). 

20. A sample space contains two events, E and F , and 
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P(E) = 0.5, P(F) = 0.3, P(EHF) = 0.2. 


Determine 

P(F), P(EUF), P(EHF). 

21. The events E, F, and G satisfy 

P(E) = 0.6, P(F) = 0.6, P(G) = 0.8, 

/>(£ UP) = 0.8, P(£ H G) = 0.5, P(F H G) = 0.5. 

Determine 

P(EHF), P(EUG), P(FUG). 

22. The events E, F, and G satisfy 

P(E) = 0.6, P(F) = 0.5, P(G) = 0.5, 

R(£ UP) = 0.8, P(E U G) = 0.8, P(F fl G) = 0.2. 

Determine 

P(PflF), P(£HG), P(FUG). 

23. An examination has two questions. Of 100 students, 75 do Question 1 correctly 
and 72 do Question 2 correctly. Sixty-four do both questions correctly. 

(i) Represent the data in a Venn diagram. 

(ii) A student’s answer book is chosen at random. What is the probability that 

(a) Question 1 contains an error? 

(b) exactly one question contains an error? 

(c) at least one question contains an error? 

24. Of 1000 researchers at Microsoft, 375 have a degree in mathematics and 450 
have a degree in computer science. Of the researchers, 150 have degrees in both 
fields. One researcher’s name is selected at random. 

(i) What is the probability that the researcher has a degree in mathematics, but 
not in computer science? 

(ii) What is the probability that the researcher has no degree in either mathemat¬ 
ics or computer science? 

25. In a market survey, 50% of those polled said that they usually buy medications 
at a pharmacy and the others that they buy them at a supermarket. 80% buy meat 
at the supermarket, 20% at the butcher’s. 40% buy both meat and medications 
at the supermarket. One shopper is selected at random from the survey group. 
What is the probability that he 

(i) buys meat at the supermarket but buys medications at the pharmacy? 

(ii) does not buy either meat or medications at the supermarket? 
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26. During the 1992 presidential elections, 200 voters were surveyed in the St. Louis 
area. Voters were classified according to whether they lived in Missouri (MO), 
Illinois (IL), or other states (OS), and whether they intended to vote for Bush 
(REP), Clinton (DEM), or Perot (IND). The results were as follows: 



REP 

DEM 

IND 

MO 

44 

33 

21 

IL 

20 

22 

14 

OS 

14 

20 

12 


One of the voter’s responses is selected at random. What are the probabilities of 
the following events? 

(i) The voter is an Illinois democrat. 

(ii) The voter is from Missouri. 

(iii) The voter intends to vote for Perot. 

(iv) The voter is from outside Illinois. 

(v) The voter is either a Democrat or is from Missouri. 

27. Two hundred new automobile buyers were surveyed. Their purchases were as 
follows: 



Sedan 

Pickup 

Van 

Ford 

50 

10 

10 

GM 

35 

10 

15 

Other 

40 

10 

20 


One survey form is chosen at random. What is the probability that the vehicle 
was 

(i) a sedan? 

(ii) a Ford? 

(iii) a General Motors van or pickup? 

28. Five hundred researchers at the National Security Agency were surveyed. It was 
found that 300 have at least one degree in computer science, 173 have a Bach¬ 
elor’s degree in computer science, 123 have a Master’s degree in computer sci¬ 
ence, and 99 have a Ph.D. in computer science. Additionally, it was found that 63 
have both a Bachelor’s and Master’s degree in computer science, 22 have both a 
Master’s and Ph.D. degree in computer science, and 19 have both Bachelor’s and 
Ph.D. degrees in computer science. One employee is selected at random. What 
are the probabilities of the following events? 

(i) She has all three degrees in computer science. 

(ii) She has the Bachelor’s and Ph.D. degrees, but not the Master’s. 
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29. Last year, there were 200 students enrolled in Calculus II. Of these students. 140 
also enrolled in Linear Algebra. Their majors were distributed as follows: 


Enrolment 

Major 

Math 

Science 

Business 

Calculus only 

20 

35 

5 

Calculus and Linear Algebra 

90 

45 

5 


Suppose the same trend is followed this year. When the next student comes to 
enroll in Calculus II, what is the probability that this student 

(i) is a science major? 

(ii) is not enrolling in Linear Algebra this year? 

(iii) is not a mathematics major, but will take Linear Algebra? 

30. One hundred fifty Business students tried to sign up at registration for required 
Marketing, Accounting, and Finance courses. Seventy registered for Marketing, 
60 for Accounting, and 50 for Finance. Twenty-five registered for both Mar¬ 
keting and Accounting, 15 for Finance and Marketing, 15 for Finance and Ac¬ 
counting, and 5 signed up for all three courses. One student’s registration form 
is chosen at random. What is the probability that the student 

(i) was unable to sign up for any of the courses? 

(ii) signed up for exactly one of the courses? 

(iii) signed up for Marketing and Accounting, but not for Finance? 

31. Jerry’s Gas sells three grades of gasoline at both their full and self-service 
pumps. One day they kept track of their first 100 customers and their purchases 
in the following table: 



Regular 

Regular 

Premium 


Leaded 

Unleaded 

Unleaded 

Full service 

15 

15 

20 

Self service 

20 

25 

5 


If this trend continues, what is the probability that the next customer who comes 
in 

(i) goes to the self-service pumps? 

(ii) buys regular unleaded at the self-service pumps? 

(iii) buys premium unleaded at the full-service pumps? 
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6.2 Repeated Experiments 

Stochastic Processes 

Sometimes an experiment can be viewed as a sequence of smaller experiments. The 
outcome consists of a sequence: “outcome of subexperiment 1” followed by “out¬ 
come of subexperiment 2,” and so on. An experiment of this kind is called a stochas¬ 
tic process. 

For example, suppose ajar contains three red and two blue marbles. An exper¬ 
iment consists of drawing a marble from the jar, noting its color, drawing another 
marble, and noting the second marble’s color. (The first marble is not replaced.) The 
possible outcomes are the four ordered pairs of colors: RR, RB, BR , and BB. 

To analyze the experiment, we first observe that the first marble drawn is either 
red (three-fifths of the cases) or blue (two-fifths). If the first marble is red, then the 
remaining marbles are two red and two blue, so in half of these cases (^ x | ^ 

of the original cases) the second marble drawn is red and in the other half (^ of the 
original) it is blue. So, in the obvious notation, 

3 

P(RR) = —, 

10 

3 

P(RB) = —. 

10 

In the same way, if the first marble is blue, then the second marble drawn is red in | 
and blue in ^ of the cases. So 

2 3 3 

P(BR) = - x - = —, 

7 5 4 10 

2 1 1 

P(BB) = - x - = —. 

5 4 10 

So the four outcomes are RR, RB, BR, and BB, and 

P(RR) = P(RB) = P(BR ) = A P (bB) = 

Sample Problem 6.8. There are two bags A and B. Bag A contains two red balls 
and one green ball; bag B contains two red and three green balls. First a bag is 
chosen at random, then a ball is chosen at random from it. What are the possible 
outcomes, and what are their probabilities? What is the probability that a red 
ball is selected? 

Solution. The outcomes ar z AR, AG, BR, and BG where A and B are the bags 
and R and G denote the color of the ball selected. The possible outcomes of 
the first subexperiment—the selection of bag—are A and B, with probabilities 
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each ^. If A is selected, the probability of R is | and the probability of G is ^. If 
B is selected, 7? has probability | and G has probability |. So 


1 2 
P(AR) = - x - 

m G) = ^ x 1 

1 2 
P(BR ) = - x - 
2 5 

1 3 

P(£G) = - x - 
v ' 2 5 

So the probability of red is 


1 

3’ 

1 

6’ 

1 

5’ 

3 

To’ 


P(AR) + P(BR) = ^ + ^ = 


8 

F5 


Notice that there are four red and four green balls. If the two bags were emptied 
and one ball chosen, the probability of red would be The process of dividing 
the balls into two bags, whose contents are not identical, changes the probabili¬ 
ties. 


Practice Exercise. A die is chosen from a pair of dice and rolled. One die 
(die A) is standard; the other (die B) has three faces marked 1 and three marked 6. 
What are the outcomes? Assuming that the die is chosen at random, what is the 
probability of a 6? Of a 3? 


Tree Diagrams 

The use of tree diagrams to represent experiments (see Section 5.1) is particularly 
useful for stochastic processes. Let us go back to the example of three red and two 
blue marbles. The first stage of the experiment can be represented as a branching into 
two parts, labeled “red” and “blue,” and the branches can be also be labeled with the 
probabilities | (on red) and | (on blue): 

R 


B 

In each case the second stage can be represented similarly. The tree diagram for 
the whole experiment is shown in Figure 6.1. 

Finally, the outcomes are sequences of the results of the subexperiments. Each 
endpoint on the right of the tree represents an outcome, and the sequence can be read 
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R 


B 


R 


B 


Fig. 6.1. Tree diagram for the whole experiment 
R P(RR) 



3 1 
5 X 2 


B P(RB) 

R P(BR) 


3 1 

" 5 x 2 

2 3 

= 5 X 4 


10 


3 

10 


2 1 
" 5 x 4 


)B P(BB) 

Fig. 6.2. Tree diagram with branch probabilities 


10 


10 


by tracing back through the branch to the root. The probability of an outcome can 
be calculated by multiplying the probabilities along the branch. So the experiment 
under discussion yields the diagram Figure 6.2. 


Bernoulli Trials and Binomial Experiments 

Consider the experiment “flip a coin three times and count how many heads occur.” 
This can be viewed as repeating one basic experiment that has exactly two outcomes 
(“flip a coin”) three times and then counting the results. 

We shall define a Bernoulli trial to be an experiment in which there are exactly 
two possible outcomes. In many applications it makes sense to think of one of these 
events as “success” and the other as “failure” (abbreviated to S and F). We shall 
often denote the probability of success by p. A binomial experiment is one in which 
a Bernoulli trial is repeated a certain number of times, and the outcome is the number 
of successes in total. So our initial example was a binomial experiment in which the 
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trial is repeated three times. Since the number of heads is to be counted, we would 
probably refer to a heads as a “success.” If the coin was fair, p will equal \. 

In a binomial experiment, if the trial is repeated n times, there are n + 1 possible 
outcomes, from “0 successes” to “n successes.” 

Sample Problem 6.9. A trial consists of throwing a die; a result of 1 or 2 is a 
success , other throws are failures. What is the probability of two successes in 
three trials? 


Solution. There are three ways in which two successes can occur: the sequences 
SSF, SFS , and FSS. Now 


1 1 2 

P(SSF) = - x - x - 

3 3 3 

1 2 1 

P(SFS) = - x - x - 

3 3 3 

2 1 1 

P(FSS) = - x - x - 

So the probability of two successes is 


2 

27’ 

2 

27’ 

2 

27' 


2 2 2 _ 6 
27 + 27 + 27 “ 27' 


Practice Exercise. In the experiment described in this sample problem, what is 
the probability of exactly one success in three trials? 


More generally, the number of sequences of n trials that contain k successes and 
n — k failures is (^). The probability of any such sequence is p k ( 1 — p ) n ~ k . So the 
probability of k successes is 


CM 1 -* 


Sample Problem 6.10. A sales representative estimates that a sale results from 
one in four of his calls to companies. If he makes five calls today , what is the 
probability of at least two sales? 


Solution. Each call is a Bernoulli trial with p = so the five calls in a day can 

be thought of as a binomial experiment with p = n = 5. So 


P (2 successes) = 



270 

1024’ 

90 

l024’ 


P (3 successes) = 
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P( 4 successes) = 


P( 5 successes) = 



- I = 5 x — = 
4 5 

1 

- = 1 x — = 


15 

1024’ 

1 


4 5 1024’ 


and the probability of at least two successes is the sum of these: 


270 + 90+ 15 + 1 _ 376 _ 47 
1024 “ K)24 ” 128’ 

or about 37%. Alternatively, we could have noticed that “he makes at least two 
sales per day” is the complement of “he makes 0 or 1 sales per day” The proba¬ 
bility of 0 or 1 sales is 

/5\ /1 \°/3\ 5 /5\ /1 \ 1 /3\ 4 

P( 0 sales) + P(1 sale) = ( )[-)(-)+( )(-)(-) 

\0/V 4 / V 4 / VVV 4 / V 4 / 

3 5 3 4 

= 1 x — + 5 x — 

45 45 

_ 243 + 5 x 81 _ 243 + 405 

“ 1024 “ 1024 

648 _ 81 

“ I024 ““ E28 ’ 

so the probability of at least two sales is 

81 _ 47 
~~ 128 “ 128 


Practice Exercise. If the salesman could improve his record to give him a 50% 
chance of a sale from each call, what is his probability of at least three successes 
in the day? 

Note that, for practical purposes, the exact value ^ in the preceding sample 
problem is not important. It would normally be enough to say “his chance of two or 
more sales is a little better than one in three.” In probability problems, it is important 
to have the correct formula, and to have an approximate idea of the numerical value. 
If, for any reason, a precise value is needed, it can be calculated from the formula. 


Sample Problem 6.11. The salesman in the preceding sample problem would 
like to have at least a 60% chance of two successes per day. How many calls 
should he make per day? 


Solution. Suppose he makes n calls. The probability of 0 or 1 success is 
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So what he wants is 


3" -1 (3 + n) __ 4 

4" < To' 


We do some experimental arithmetic: 

^-in 4- 

n = 5 : 
n = 6 : 


n = 7 


4" 

= = 0. 
1024 

3" _1 (3 + n) 

243 x 9 
— — ( 

4" 

4096 

3"- 1 (3 + n) 

729 x 10 

4 n 

16384 

3 n ~ l (3 + n) 

2187 x 11 


4 ft 

So at least eight calls are needed. 


65536 


= 0.44... 
= 0.36. 


Practice Exercise. How many calls are needed to give the salesman at least a 
70% chance of two successes per day? 


The Birthday Coincidence 


Sometimes probabilities can be very surprising. One well-known example is the 
“birthday coincidence” problem. Suppose there are 30 people in a room. What is 
the probability that two of them share the same birthday (day and month)? 

For simplicity let us ignore leap years. (The answer will still be approximately 
correct.) What is the probability of the complementary event—that no two have the 
same birthday? (The probability of a birthday coincidence will be found by sub¬ 
tracting this probability from 1.) If we assume the people are ordered in some way 
(alphabetical order, for example), then there are 365 choices for the first person’s 
birthday, 364 for the second, 363 for the third, and so on. So the total number of 
possible lists of birthdays (with no repeats) is 


P( 365, 30) = 365 x 364 x ... x 336. 


If there is no restriction, each person has 365 possible birthdays. So there are 365 30 
possibilities. Therefore the probability of the event “no two have the same birthday” 
is 


P(E ) = — 

|S| 


P (365, 30) 
365 30 


which is approximately 0.294. So the probability of the “birthday coincidence” is 
about 0.706. 


Similar calculations can, of course, be carried out for any number of people. 
Table 6.2 shows the probability of a “birthday coincidence” for various numbers of 
people. It is about a 50% chance when there are 23 people in the room. 
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n 

Pn 

5 

0. 027 

10 

0. 117 

15 

0. 253 

20 

0.411 

23 

0. 507 

25 

0. 569 

30 

0. 706 

40 

0. 891 

50 

0. 970 


Table 6.2. P n is the probability of a “birthday coincidence” among n people 


Most students find this very surprising, and would guess that many more people 
would be needed to bring the probability up to 50%. 

Exercises 6.2 

1. Ajar contains six marbles, four red and two white. In stage 1 of an experiment, 
one marble is selected at random and its color noted; it is not replaced. In stage 2, 
another marble is drawn and its color noted. 

(i) Draw a tree diagram for this experiment. 

(ii) What are the probabilities of the four outcomes ( RR , RW , WR , WW)1 

(iii) What is the probability that the two marbles selected are different colors? 

2. A cage contains six mice: three white females, one white male, and two gray 
males. In an experiment, two mice are selected one after the other, without re¬ 
placement, and sex and color are noted. 

(i) Draw a tree diagram for this experiment. 

(ii) Find the probabilities of all the outcomes. 

(iii) What is the probability that two males are selected? 

(iv) What is the probability that both a white and a gray mouse are selected? 

(v) What is the probability that the second mouse is gray? 

3. Three cards are dealt in order from a standard deck. In each case it is only 
recorded whether the card is a face card (ace, king, queen, jack) or a minor 
card (10 through 2). 

(i) Draw a tree diagram for this experiment. Find the probabilities of the differ¬ 
ent outcomes. 

(ii) What is the probability that at least two face cards are dealt? 

4. Three cards are dealt in order from a standard deck. Only the color (red or black) 
of the card is recorded. The cards are not replaced. 
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(i) Draw a tree diagram for this experiment. 

(ii) What is the probability that the three cards are red? 

5. Eight evenly matched horses run a race. Three are bay colts, one is a bay filly, 
two are brown colts, and two are brown fillies. The color and sex of the first and 
second finishers are recorded. 

(i) Draw a tree diagram for this experiment. 

(ii) What is the probability that both the horses recorded are colts? 

(iii) What is the probability that the two horses are of different colors? 

(iv) What is the probability that the second horse is a filly? 

6. A jar contains four red, three white, and two blue marbles. One marble is drawn 
and placed aside; then another marble is drawn. Only the colors are recorded. 
Draw a tree diagram for this experiment. What is the probability that the two 
marbles are of different colors? 

7. A coin is weighted so that a heads is twice as likely to occur as a tails. It is tossed 
four times, and the result is noted. 

(i) What is the probability that exactly one heads occurs? 

(ii) What is the probability that at least three heads occur? 

8. A die is rolled six times. What is the probability of 

(i) exactly one six? 

(ii) at most one six? 

9. A fair coin is tossed six times. Which is more probable—that heads and tails 
occur three times each, or that they divide 4 and 2 (either 4 heads or 4 tails)? 

10. A multiple choice test contains five questions, and each question has three pos¬ 
sible answers. A passing grade is three or more correct answers. What is the 
probability of passing if you guess the answers at random? 

11. In a 20-question, true-false test, what is the probability of getting exactly 16 
answers correct by random guessing? 

12. A pair of fair dice are rolled and the total is recorded. If this is done ten times, 
what is the probability that 7 is recorded at least three times? 

13. On average, 1 light bulb in 20 is defective. What is the probability that there is 
more than one defective bulb in a box of ten? 

14. A drug is effective in nine cases out of ten. If it is tested on 12 patients, what is 
the probability that it will be ineffective in more than two cases? 

15. A baseball pitcher estimates that he can throw his fast ball for a strike seven 
times in every eight. 

(i) What is the probability that he throws exactly seven strikes in eight pitches? 
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(ii) What is his probability of throwing at least seven strikes in eight pitches? 

16. A children’s game uses a die that has 1 on three of its faces, 2 on two of its faces, 
and a 3 on the last face. If it is rolled six times, what is the probability of rolling 
2 exactly three times? 

17. The probability that a new tire will last 35,000 miles is 0.9. If you replace all 
four tires now, what is the probability that you will need to replace at least one 
within the next 35,000 miles? 

18. There are five people in a room. What is the probability that two were born in 
the same month (but not necessarily the same year)? 


6.3 Counting and Probability 

All of the elementary counting principles that we studied in Chapter 5 are useful in 
calculating probabilities. We shall show this using several examples. 


Choosing Marbles 


Many elementary problems involve random drawing of marbles from ajar (or bag 
or box). These obviously arise in lotteries and other games of chance, but this is 
not the only reason for studying such problems. The mathematics is the same if we 
consider many other types of random selection. As an example, suppose there are 500 
Democrat supporters and 400 Republicans in your area. Working out the probabilities 
of various combinations being contacted in an opinion poll involves exactly the same 
calculations as the ones made in studying random drawings from a jar of 500 black 
and 400 white marbles. There are also examples in physical science—for example, 
molecules escaping from a container of heated gas behave in the same way. 


Sample Problem 6.12. There are 12 red marbles and 3 blue marbles in a jar. 
Three are selected at random. What is the probability that all are red? 


Solution. There are 15 marbles, so the number of ways of selecting three is ( 3 5 ). 
So | S\ — (^). The number of ways of selecting three red balls is \E\ = ( 3 2 ). So 


P(E) = 


(j 2 ) _ 12! 12!3! 

(15) “ 9!3! X TIT 

12-11-10 _ 44 
15-14-13 ~~ 91' 


Practice Exercise. An urn contains five red, four blue, and two green marbles. 
Three marbles are chosen at random. What is the probability that the three are all 
different colors? 




186 6 Probability 


Sample Problem 6.13. A box contains four red, three white, and two blue balls. 
Two balls are chosen at random. What is the probability that they are the same 
color? 

Solution. There are Q selections available. There are ( 2 ) = 6 ways to choose 
two red balls, ( 2 ) = 3 ways to choose two white balls, and ( 2 ) = 1 ways to 
choose two blue balls. So 

12 + 6+1 19 

P (two are the same) =-_-= —. 

(2) 36 

Practice Exercise. A jar contains six white and four blue balls. Two balls are 
chosen at random. What is the probability of selecting at least one white ball? 


Card Problems 


Remember that in a standard deck there are 52 cards, 13 in each of the four suits. 
So there are four cards in each of the 13 denominations: Ace, King, ..., 4, 3, 2. In 
most card games, the hands of cards are dealt face down to the players, so the order 
in which cards are received does not matter. 


Sample Problem 6.14. A poker hand of five cards is dealt from a standard deck. 
What are the probabilities of the following hands? 

(i) The four kings and the ace of spades. 

(ii) A full house (three cards of one denomination, two of another). 

(iii) A hand with no pair (all different denominations). 


( 52 \ 

5 j possible selections of 

five cards from the full deck of 52. So, in each case, \S\ = ( 5 5 2 ). 

(i) The event E \ “the hand consists of four kings and the ace of spades” contains 
only one outcome. So 


\E\ = l, 

\E\ 1 

p<£, = w = (!) 

5-4-3-2- 


1 


52-51 -50-49-48 


1 

2598960’ 


(ii) Let’s say a full house is of “type (x, y)” if it contains three jc’s and two y’s— 
one could have “type (7,5),” “type (king, 4),” and so on. There are 13 ways 
to choose the denomination of the three, and for each such choice there are 
12 ways to select the two. So there are 13 x 12 = 156 types. Once the type 
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is known, we can calculate the number of hands of that type. For example, if 
there are three kings and two fours, then there are ( 3 ) ways of choosing which 
kings are to be used (we must choose three out of the four possible kings), 
and ( 2 ) ways of choosing which fours. So there are ( 3 ) x (^) = 4 x 6 = 24 
hands of each type. So the number of full houses is 

13 x 12 x x 
= 13 x 12 x 24, 

and the probability is 

13-12-24 _ 6 

( 5 s 2 ) “ 4165' 



(iii) If there is no pair, the hands have one card of each of five denominations. 
This collection of denominations can be chosen in (* 5 3 ) ways. The card in 
each denomination can be selected in four ways. So the number of hands is 



and the probability is 

O' 45 _ 2112 


which is a little greater than 50%. 

Practice Exercise. A poker hand of five cards is dealt from a standard deck. 
What is the probability that it contains a flush (all five cards of the same suit)? 


Choosing a Committee 


Sample Problem 6.15. A committee of four people is to be chosen at random 
from a club with 12 members, four men and eight women. 

(i) What is the probability that at least one man is chosen? 

(ii) What is the probability that one particular member, Jack Smith, is chosen ? 


Solution. The number of possible outcomes—all possible committees—is ( 4 2 ) . 
This is | S |. They are equally likely. 

(i) Let E be the event that at least one man is chosen. Then E is the event that 
no man is chosen. \ E\ = (4). So \E\ = ( 4 2 ) — ( 4 ) and 


P(E) = 1 - 


0 . 

(?) 


14 

99 


= 1 -= 1 - 


85 

99’ 
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(ii) The number of committees containing a given individual is (g 1 ). (Jack Smith 
must be a member; the remaining three are chose from the other 11 members 
and that can be done in (g 1 ) ways.) Therefore, 

(V) 1 

P(Jack Smith is chosen) = 

o 3 

Practice Exercise. A class has 18 members—six math majors, five in eco¬ 
nomics, and seven in computer science. A class committee of three is chosen 
at random. What is the probability that all three have the same major? 


Derangement Problems 


The number of derangements of n objects is 


D n = n\ 


1 1 1 
l! + 2! ~~ 3! + 


+ (-!)"■ 
n\ 


The number of ways to arrange n objects is n!, so the probability that a randomly 
selected arrangement is a derangement is 


Dn 

n\ 


1 1 1 
II + 2! ~~ 3! + 


+ (-D" A 

n\ 


Sample Problem 6.16. Four students have identical backpacks. As they leave the 
library, each chooses a backpack at random. What is the chance that none gets 
the correct backpack? 

Solution. The probability is 

D 4 _ 111 1 _ 3 

IT “ - I + 2 - 6 + 24~8' 

Practice Exercise. What is the probability that some, but not all, get their cor¬ 
rect backpacks? 


Exercises 6.3 

1. There are three red, three blue, and four white marbles in ajar. Two marbles are 
chosen at random. What are the probabilities that 

(i) both are blue? 

(ii) the two are different colors? 

2. There are ten red, three white, and four blue marbles in ajar. Two are chosen at 
random. What are the probabilities that 




(i) neither are red? 

(ii) exactly one is red? 

(iii) both are red? 
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3. From the jar in the preceding exercise, three marbles are chosen. What are the 
probabilities that 

(i) the three are red? 

(ii) the three are of different colors? 

4. A jar contains four red, five blue, and three white marbles. Three marbles are 
chosen at random. What are the probabilities that 

(i) all three are red? 

(ii) none are red? 

5. A student must select two courses from a list of two humanities and three sci¬ 
ence courses. He selects at random. What is the probability that both are science 
courses? 

6. A box of 50 matches contains three broken matches. Two matches are drawn at 
random. What is the probability that neither is broken? 

7. Your midterm test contains ten questions with true-false answers. If you select 
your answers at random, what are the probabilities of getting 

(i) exactly four correct answers? 

(ii) exactly two correct answers? 

(iii) at most two correct answers? 

8. A committee of six people (four men and two women) need to select a chairman 
and secretary. They do so by drawing names from a hat. What are the probabili¬ 
ties that 

(i) both are men? 

(ii) exactly one is a man? 

9. In the game Yahtzee, five fair dice are rolled at once. What are the probabilities 
of getting 

(i) a Yahtzee, that is, all five dice the same? 

(ii) four of a kind (that is four of one denomination, with the other different)? 

(iii) a full house (that is three of one denomination, and two of another)? 

10. An electrician knows that two switches are faulty out of a batch of five. He tests 
them one at a time. What is the probability that he finds the two faulty switches 
in the first two trials? 
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11. A five-card poker hand is dealt at random from a standard deck. What is the 
probability that it contains a straight (A2345, 23456, 34567, ..., 10JKQA)? (In 
a straight, suits do not matter.) 

12. Suppose license plates consist of two letters followed by four numbers. Suppose 
all possible combinations have been used. If you select a car at random, what is 
the probability that the plate has 

(i) both letters the same? 

(ii) no repeated symbol (letter or number)? 

(iii) an even number? 

13. A congressional committee contains five Democrats and four Republicans. To 
choose a subcommittee they list all groups of four that contain at least one Re¬ 
publican and one Democrat, and then select one at random. What is the proba¬ 
bility that the selected subcommittee consists of two Democrats and two Repub¬ 
licans? 

14. A secretary types three letters and the three corresponding envelopes. She is in a 
hurry to quit work for the day, and forgets to check that she puts the correct letters 
in the correct envelopes, so it is as though she puts the letters in the envelopes at 
random. What are the probabilities that 

(i) every envelope gets the correct letter? 

(ii) no envelope gets the correct letter? 

15. A committee of six men and four women select a steering committee of three 
people. If all combinations are equally likely, what is the probability that all 
three will be of the same gender? 

16. Four people apply for two jobs. Among them are a husband and wife. Assume 
that the four are equally well qualified and choices are made at random. 

(i) What is the probability that both husband and wife are appointed? 

(ii) What is the probability that at least one is appointed? 

17. In a lottery, you must select three different numbers from 1, 2, 3, 4, 5, 6, 7, 8, 9. 
If your three are the numbers drawn, you win first prize; if you have two right 
out of three, you win second prize. (The order of the numbers does not count.) 

(i) What is the probability of winning first prize? 

(ii) What is the probability of winning second prize? 

18. A club has 12 members, including Mr. and Mrs. Smith. A committee of three is 
to be chosen at random. 

(i) What is the probability that both are chosen? 

(ii) What is the probability that neither are chosen? 
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19. Your discussion group chooses its chairman at each monthly meeting by drawing 
names from a hat. If there are 12 members, what is the probability that you will 
not be chosen at any meeting this year? 

20. A pinball machine selects a number from 0 to 9 at random; you are awarded a 
free game if that number equals the last digit in your score. 

(i) What is the probability that you win a game in this way on your first at¬ 
tempt? 

(ii) What is the probability that you do not win a game on your first attempt, 
but you win a game on your second attempt? 

(iii) What is the probability that you will get no “match” in five games? 
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Conditional Probability Defined 


Suppose you draw a marble at random from a jar containing two red and two blue 
marbles. You look at this marble, then discard it, and draw another marble. What is 
the probability of drawing a blue marble the second time? There are two answers. If 
the first marble is red, then the probability is |; if it is blue, the probability is ^. We 
write 


P (second blue | first red) = -, 
P (second blue | first blue) = -. 


The sign “|” is read as the word “given,” so the first probability above is “probabil¬ 
ity that the second is blue given that the first is red ” These probabilities are called 
conditional because they express the probability of a result in the second draw if a 
certain condition (the result of the first draw) is satisfied. 


In general, suppose the fact that event A occurs gives extra information about 
the probability that B occurs. The probability that B occurs, given that A occurs , 
which is the conditional probability P(B | A), is the best measure to use in deciding 
whether or not B will happen. 


Sample Problem 6.17. Two cards are dealt from a standard deck. The first card 
is not replaced before the second is dealt. What is the probability that the second 
card is a king, given that the first card is an ace? What is the probability that the 
second card is a king, given that the first card is a king ? 


Solution. Given that the first card is an ace, the second card is a random selection 
from 51 equally likely possibilities. Four of these outcomes are kings. So 
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P{AC) = P(A) P(C I A) 

P(AD ) = P(A ) P(D I A) 

P(BC ) = P(B) P(C IB ) 

P(BD) = P(B) P(D I B) 


Fig. 6.3. Tree diagram with branch probabilities 


P(K | A) = 


n(E) 

n(S) 


4 

5T 


If the first card is a king, there are only three kings among the cards for the second 
trial, so 


P(K | K) = —. 


Practice Exercise. Two cards are dealt, without replacement, from a standard 
deck. If the first is a spade, what is the probability that the second is (i) a spade? 
(ii) a heart? 


The probabilities written on the later branches of a tree diagram are all condi¬ 
tional probabilities. If an experiment consists of two stages in which the first stage 
has possible outcomes A and B and the second stage has possible outcomes C and 
D , the tree diagram is shown in Figure 6.3. Of course, this all works in the same way 
when there are more than two stages, or when there are more than two outcomes at a 
stage. 

Sample Problem 6.18. Two cards are dealt from a standard deck without re¬ 
placement, and it is noted whether or not the cards are kings. What are the out¬ 
comes of this experiment, and their probabilities? Represent the information in a 
tree diagram. 

Solution. We shall write K and N for “a king” and “a card other than a king.” 
By a similar argument to Sample Problem 6.17 we see that the probability of a 
king on the second draw, given a king on the first draw, is 
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K P(K,K) = 


13 51 


N 


* 


* 


Fig. 6.4. Tree diagram for Sample Problem 6.18 


3 

663 


48 

663 


48 

663 


564 

663 


and similarly 


P(K | A0 = —. 


Given that the first card is a king, the second card is either a king or not, so 
P(K | K) + P(N \ K) = 1; 


therefore, 

and similarly, 


P(N | K) = 


48 

5l’ 


47 

P(7V | AO = —. 


So the diagram is the one shown in Figure 6.4 (where, for example, “K, N v 
means “king first, non-king second”). 


Practice Exercise. Repeat this sample Problem, in the case where the informa¬ 
tion noted is whether or not the card is a spade. 


The Conditional Probability Formula 

The probability of the sequence “A followed by B ” is 

P(AB) = P(A)P(B\A). (6.1) 

This formula can be used more generally, even when A does not happen before B. 
Whenever we get more information about whether an event B occurs by knowing 
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whether or not event A occurs, we define the conditional probability of B given A 
by 


P(B\A) = 


P(A n B) 
P(A ) 


( 6 . 2 ) 


Sample Problem 6.19. A car pool contains both ten red and ten white Fords and 
15 red and five white Buicks. A car is chosen at random, by picking up the keys. 

1. What is the probability that a red car is chosen? 

2. You notice that the keys belong to a Buick. What is the probability that a red 
car is chosen? 


Solution. 1. There are 40 cars, of which 25 are red. So 


\R\ 25 

P(R) = — = — 
v 7 \S\ 40 


5 

8 ’ 


2. There are 20 Buicks of which 15 are red. So 


P(B) 
P(RHB) 
P(R | B) 


20 _ 1 

40 “ 2’ 
15 _ 3 

40 “ 8’ 
P(RDB) 
P(B ) 



3 

4’ 


Practice Exercise. There are five Republicans and three Democrats on a com¬ 
mittee. A subcommittee of two is chosen by a random drawing. 

1. What is the probability that both are Democrats? 

2. You are told that the committee contains at least one Democrat. What is the 
probability that both are Democrats? 


Sample Problem 6.20. A carpenter uses screws made by two companies, 
X and Y; 40% of his stock comes from X and the rest from Y. About 2% of the 
screws from Y are faulty. If he chooses a screw at random, what is the probability 
that it is a faulty screw from company Y? 


Solution. We have 


So 


P(Y) = 0.6, P(F | Y) = 0.02. 


P(Y n F) = P(Y)P(F I Y) 

= 0.6 x 0.02 = 0.012, 
and the probability is 0.012 or 1.2%. 
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Practice Exercise. Records show that two March days out of five are dull, and 
it rains on one out of every three dull days. What is the probability that March 
12th next year will be dull and rainy? 


Independence 


We say two events are independent in everyday English if they have no effect on each 
other. In probability theory, this idea is formalized as follows: Events A and B are 
independent when 

P(A | B) = P(A). 

Provided neither A nor B is impossible, the relation 


P(B | A) = 


P(AHB) 
P(A ) 


leads us to the equivalent definition: A and B are independent when 


P(AHB) = P(A) x P(B). 


Sample Problem 6.21. Two dice are rolled and their sum is recorded. Consider 
the following events. 

A: Sum is 2, 8, or 11. 

B: Sum is even. 

C: Sum is 4, 7, or 10. 

Which of these events are independent? 


Solution. We write Ei for the outcome “the sum is i .” These outcomes are the 
events shown with their probabilities in Table 6.1. Then 

A = {E2, Eg, En}, 

B = {E2, E4, Eft, Eg, E10, E12}, 

C = {E 4 , E 7 , E10}, 

A (T B = {E2, Eg}, 

Anc = 0, 

b n c = {E4, E10}, 

and from the table, 

P(A) = (1 + 5 + 2)/36 = 8/36, 

P(B) = (1 + 3 + 5 + 5 + 34- l)/36 = 18/36, 

P(C) = (3 + 6 + 3)/36 = 12/36, 
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P(A n B) = = 6/36, 

36 

p(Anc) = o, 

P(B fl C) = (3 + 3)/36 = 6/36. 

Now 

P(A) x P(B ) = 8/36 x 18/36 = 4/36 / P(A n 5), 

P(A) x P(C) = 8/36 x 12/36 = 8/108 / P(A n C), 

P(fi) x P(C) = 18/36 x 12/36 = 6/36 = P(B n C), 
so B and C are independent, but A and B are dependent, and so are A and C. 

Practice Exercise. An urn contains eight balls numbered 1 through 8. Balls 1, 
2, and 3 are red; 4, 5, 6, and 7 are white; 8 is blue. One ball is drawn. Consider 
the following events 

A : The ball is red. 

B : The ball is blue. 

C: The number is odd. 

Which of these events are independent? 


A Summation Formula 


Suppose two events, A and B are considered. Since 

A = (AnB)u(AnB), 

and the latter events are mutually exclusive, we have 

P(A) = P(AHB) + P(AHB), 


so 

P(A) = P(A | B)P(B) + P(A | B)P(B). 


More generally, if the possible outcomes of an experiment are B \, B 2 , ..., then 

k 

P{A) = Y J P{A\B i )P{B i ). (6.3) 

i =1 


In terms of tree diagrams, this formula is a way of stating in symbols the fol¬ 
lowing rule: “to find the probability of A, find the probabilities associated with all 
branches that contain A, and sum them.” 
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Sample Problem 6.22. There are six boxes, two of them round and four square. 
Each round box contains two green marbles and three blue marbles. Each square 
box contains one green marble and three blue marbles. A box is chosen at random 
and a marble is chosen at random from it. What is the probability that the marble 
is blue? What is the probability that the box was round, given that the marble 
was blue ? Represent the probabilities in a tree diagram. 

Solution. We write R, S, G, B for round, square, green, blue. Then the proba¬ 
bilities of choosing round and square boxes are 

P(R) = 1/3, P(S) = 2/3. 


So 


P(B) = P(B | R)P(R) + P(B | S)P(S) 
= (3/5) (1/3) + (3/4) (2/3) 

= 1/5 + 1/2 = 7/10. 

We can calculate P(R D B)\ 


So 


P(R HB) = P(B 

P(R I B) = 


| R)P(R) = (3/5)(1/3) = 1/5. 

P(RHB) 1/5 _ 2 
P(B) ~ 7/10 " 7' 


The diagram is 

B P(RDB)= 1 

g />(/? n G) = 1 x 1 = -1 

B P(S n B) = | 1=1 

g P(S n G) = | 1 = 1 

P(B) = P(R n B) + P(S n B) = 1/5 + 1/2 = 7/10. 

(This last equation is the sum of the probabilities of all branches that contain B.) 



Practice Exercise. Consider the builder in Sample Problem 6.20. Suppose fur¬ 
ther that about 1.5% of the screws from company X are faulty. If he selects a 
screw at random, what is the probability that it is faulty? 
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Exercises 6.4 

1. Box A contains three red pens and four blue pens. Box B contains two red, one 
green, and one blue pen. A pen is selected from Box A at random, and placed 
in Box B. Then a pen is selected at random from Box B. What is the probability 
that this pen is 

(i) red? (ii) blue? (iii) green? 

2. A city worker can either go to work by car or by bus. If she goes by car, she uses 
the tunnel 40% of the time and the bridge 60% of the time. If she takes the bus, 
it is equally likely that her bus will use the tunnel or the bridge. 

(i) Draw a tree diagram to represent her possible routes to work. 

(ii) Suppose she drives on three days out of five and takes the bus twice out of 
every five times. What are the probabilities of the various outcomes? What 
is the probability, on a given day, that she crosses the bridge on her way to 
work? 

3. Box 1 contains five red balls and three white balls. Box 2 contains two red and 
two white balls. An experiment consists of selecting two balls at random from 
Box 1 and placing them in Box 2, then selecting one ball from Box 2 at random. 

(i) Draw a tree diagram for this experiment. 

(ii) What is the probability that the ball chosen from Box 2 is red? 

4. A card is dealt at random from a regular deck. 

(i) What is the probability that it is a jack given that it is a picture card (king, 
queen, or jack)? 

(ii) What is the probability that it is a 5, given that it is not a picture card? 

5. Two fair dice are rolled. Consider the following events. 

A: The sum is 6. 

B : Both dice show even numbers. 

C: At least one die shows a 4. 

What are 

P(A | B ), P(A | C), P(B | A), P(B | C), P(C | A), P(C \ B)1 

6. A pair of fair dice are rolled. Consider the following events. 

D\ The sum is 7. 

E : The sum is odd. 

F : At least one die shows a 4. 

What are 

P(D | £), P(E | D), P(D | F), P(F \ D)1 
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7. Two cards are selected from a regular deck, without replacement. Consider the 
following events. 

A: Both cards are red. 

B : The second card is red. 

C: At least one card is red. 

D\ At least one card is a heart. 

What are 

P(A), P(D), P(A | B ), P(A | C), P(D | B), P(D | C)? 

8. Two fair dice are rolled. Consider the following events. 

A: The sum is 8. 

B: Both dice show even numbers. 

C: At least one die shows a 5. 

What are 

P(A | B), P(A | C), P(B | A), P(B | C), P(C | A), P(C | B)1 

9. Ajar contains three white and three red marbles. Two are drawn in succession, 
without replacement, and the colors are noted. 

(i) Draw a tree diagram for this experiment. 

(ii) Let A, B, and C denote the following events. 

A: Both marbles are red. 

B : The second marble is red. 

C: At least one marble is red. 

What are P(A), P(A \ B ), P(A \ C), P(B \ C)? 

10. In Exercise 6.2.2, what are the conditional probabilities of the following events? 

(i) Two males are selected, given that at least one male is selected. 

(ii) Two females are selected, given that at least one white mouse is selected. 

11. In Exercise 1, what are the conditional probabilities of the following events? 

(i) The pen selected from Box B is blue, given that the pen selected from Box A 
was blue. 

(ii) The pen selected from Box B is blue, given that the pen selected from Box A 
was red. 

12. In Exercise 6.4.2, what are the conditional probabilities of the following events? 

(i) The worker traveled over the bridge on her way to work, given that she 
traveled by bus. 
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(ii) She used the tunnel, given that she came by car. 

13. In Exercise 6.4.3, what are the probabilities of the following events? 

(i) The ball chosen from Box 2 is red, given that both balls chosen from Box 1 
were red. 

(ii) The ball chosen from Box 2 is red, given that at least one ball chosen from 
Box 1 was red. 

(iii) The ball chosen from Box 2 is red, given that at most one ball chosen from 
Box 1 was red. 

In Exercises 14 to 21, illustrate the data with a Venn diagram and calculate P(A \ B) 


and 

'P(B | 

A). 



14. 

P(A) 

= 0.7, 

P(B) 

= 0.6, P(A U B) = 0.9. 

15. 

P(A) 

= 0.7, 

P(B) 

= 0.4, P(A\B) = 0.4. 

16. 

P(A) 

= 0.6, 

P(B) 

= 0.4, P{A C\B) = 0.1. 

17. 

P(AUB) = 

0.7, P(A HB) = 0.3, P(B) = 0.3. 

18. 

P(A ) 

= 0.6, 

P(B) 

= 0.3, P(AHB) = 0.2. 

19. 

P(A U B) = 

0.8, P(A HB) = 0.4, P(A) = 0.6. 

20. 

P(A) 

= 0.4, 

P{B) 

= 0.6, P(A U B) = 0.9. 

21. 

P(A) 

= 0.6, 

P(B) 

= 0.6,and P(A\B) = 0.4. 


In Exercises 22 to 27, A and B are independent events. Find P(AUB) and P(AP\B) 
when the probabilities are as given. 

22. P(A) = 0.6, P(B) = 0.4. 23. P(A) = 0.5, P(B) = 0.8. 

24. P(A) = 0.8, P(B) = 0.8. 25. P(A) = 0.7, P(B) = 0.5. 

26. P(A) = 0.4, P(B) = 0.4. 27. P(A) = 0.9, P(B) = 0.7. 

28. Two cards are selected without replacement from a regular deck. It is recorded 

whether or not each card is a spade. Define the events A and B as follows. 

A : The first card is a spade. 

B : The second card is a spade. 

(i) Draw a tree diagram for this experiment. 

(ii) Find P(A), P(B), P(A \ B ), P(B | A), and P(A D B). 

(iii) Are A and B independent? 

29. Repeat Problem 28 in the case where the first card is replaced and the deck is 
shuffled before the second selection. 

30. A box contains six red and four blue marbles. Two marbles are drawn from 
the box (without replacement) and their colors are noted. Define the following 
events. 


E : The first marble is red. 

F\ The second marble is red. 
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(i) Draw a tree diagram for this experiment. 

(ii) Find P(E), P(F ), P(E \ F), P(F \ E), and P(E (T F). 

(iii) Are E and E independent? 

31. Repeat problem 30 in the case where the first marble is replaced before the sec¬ 
ond drawing. 

32. A weather forecasting station predicts that on December 18 it will snow with a 
probability of 0.6, there will be a change in the wind direction with a probability 
of 0.8, and there will be both snow and a change in the wind direction with a 
probability of 0.4. If the prediction is correct: 

(i) What is the probability that there will be neither snow nor a change in the 
wind direction? 

(ii) What is the probability of snow given that the wind direction has changed? 

33. A random sample of 100 people were asked their income level and whether they 
regularly invest in the stock market. The following table gives the results of the 
survey 


Income level 

Regularly invest in 
stock market 

Yes 

No 

High 

18 

6 

Medium 

21 

16 

Low 

15 

24 


A person in the survey is chosen at random. Let H be the event that the person 
has a high income. Let Y be the event that the person regularly invests in the 
stock market, and let N be the event that the person does not regularly invest in 
the stock market. Find 

(i) P(H), (iii) P(H U 7), 

(ii) P(Y), (iv) P(N | H). 

34. Professor Jones has taught the same course for the last 20 years. Each time he 
teaches it he gives a pretest on the first day of the class. At the end of each 
semester he compares course grades with whether or not students have passed 
the pretest. Seventy percent of the course grades he gives are C or better. He 
finds that 80% of those students who got C or better in the course have passed 
the pretest while 30% of the students who got less than a C in the course also 
passed the pretest. 

(i) Draw and label a tree diagram illustrating the process. 

(ii) Find the probability that a student who takes the pretest will not pass it. 
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(iii) If a student has not passed the pretest, what is his probability of getting less 
than a C in the course? 

35. Two dice are rolled and the sum of the top faces is recorded. Let E be the event 
that the sum is an odd number, let F be the event that the sum is 4, 7, or 10, and 
let G be the event that the sum is 2, 7, or 11. 

(i) Find P(G | E). 

(ii) Determine whether E and F are independent. 

36. Suppose an urn contains two red, four green, and five blue marbles. A marble is 
selected from the urn and its color noted. If it is red or green it is withdrawn, oth¬ 
erwise it is replaced. Then a second marble is selected from the urn. Determine 
the probability the first marble was red, given the second was red. 

37. Two fair dice, one red and one green, are rolled. Events A, B, and C are defined 
as follows. 

A: The sum of the two dice is 7. 

B : The green die shows a 3. 

C : The green die shows a 1. 

(i) Are A and B independent? 

(ii) Are A and C independent? 

38. Three jars, labeled A, B , and C, contain red and green jelly beans as follows: 
A has two red and two green, B has four red and three green, and C has two 
red and five green. A jar is selected at random, and one jelly bean is selected 
at random from it. Represent this experiment in a tree diagram, and find the 
probability that the jelly bean is red. 

39. Suppose E, F, and G are any three events. Prove that 

P(E n F H G) = P(E) x P(F | G) x P(G \ E n E). 

40. Forty percent of those who take drugs also have an alcohol problem and 5% of 
those who do not take drugs have an alcohol problem. If 32% of the population 
take drugs, what is the probability that a person who has an alcohol problem also 
takes drugs? 

41. Two fair dice, one red and one blue, are rolled. Events A, B, and C are defined 
as follows. 

A: The sum of the two numbers shown is 6. 

B : The number showing on the blue die is 2. 

C: The blue die shows a 3. 

(i) Are A and B independent? 

(ii) Are A and C independent? 
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42. Two jars, labeled A and B , contain marbles as follows: A has two red, one white, 
and four blue, and B contains six red, two white, and two blue. Ajar is selected 
at random and a marble is selected at random from it. Represent this experiment 
in a tree diagram. What is the probability that the marble chosen is blue? 

In Exercises 43 to 50, E and E are events in the sample space S. You may assume that 
0 < P(E) < 1 and 0 < P(F ) < 1. Is the statement true, false, or is it impossible to 
decide ? 


43. 


45. 


47. 

49. 


P(E 

1 E) = 

1. 

44. 

P(E 

\E) = l. 

P(F 

1 S) = 

1. 

46. 

P(S 

1 F) = 1. 

P(F 

1 S) = 

P(F). 

00 

P(E 

| E n F) = 0. 

P(E 

1 F) = 

P(F I E). 

50. 

P(E 

U F)> P(EDF) 


6.5 Bayes’ Formula and Applications 

Bayes’ Formula 

If A and B are any two events, then A Pi B and B Pi A are the same event. However, 
the conditional probability formula gives two different looking expressions for their 
probabilities: 


P(AHB) = P(B | A)P(A), 
P(B n A) = P(A I B)P(B). 

It follows that the two expressions must be equal: 

P(B | A)P(A) = P(A | B)P(B). 


This formula is usually written in the form 


P(B | A) = 


P(A | B)P(B) 
P(A) 


(6.4) 


Sample Problem 6.23. A store gets 40% of its stock of light bulbs from factory X, 
35% from Y, and 25% from Z. Some bulbs are faulty: 1% of the output from X, 
2% from Y, and 3% from Z. A light bulb is chosen at random from stock and 
found to be faulty. What is the chance that it comes from factory Z? 


Solution. Write Z for the event “the bulb comes from factory Z” and A for “the 
bulb is faulty.” Then P(Z \ A) is required. From the data we know P(Z) = 0.25 
and P(A \ Z) = 0.03. To calculate P{A) we use the formula in (6.3): 
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P(A) = P(A | X)P(X) + P(A | Y)P(Y) 

+ P(A | Z)P(Z) 

= (0.01)(0.40) + (0.02)(0.35) + (0.03)(0.25) 
= 0.004 + 0.007 + 0.0075 
= 0.0185. 


So 


P(Z | A) = 


(0.25)(0.03) 

0.0185 


which is between 40% and 41%. 


75 _ 15 
185 “ 37’ 


Practice Exercise. You are given three coins; one is biased so that it shows 
heads two-thirds of the time, and the other two are fair (heads and tails are equally 
likely). You select a coin at random and flip; it shows heads. What is the proba¬ 
bility that it is the biased coin? 


The formula in (6.4) is called Bayes’ Formula. It is often combined with (6.3) as 
follows. Suppose the possible outcomes of an experiment are B\, B 2 , ..., B k . Then 


P(Bi | A) = 


_ P(A | Bi)P(Bj) _ 

P(A | B 1 )P(B l ) H-h P(A | B k )P(B k )' 


(6.5) 


(The denominator of (6.5) comes directly from (6.3).) 

When Bayes’ formula is used, very often the outcomes B\, B 2 , ..., B k are the 
outcomes of an experiment that occurred earlier, and A is an outcome, or set of 
outcomes, of a later experiment. Bayes’ formula answers the question, “given the 
outcome of the second experiment, what is the probability that the outcome of the 
first experiment was so-and-so?” This sometimes confuses students because it seems 
to suggest that the outcome of the later experiment can somehow affect the earlier 
experiment, and that is impossible—causes must come before effects. What, in fact, 
happens is that knowledge of the later experiment can increase our (incomplete) 
knowledge of the earlier experiment. 


Sample Problem 6.24. The car pool contains ten Fords (five red and five white) 
and 15 Pontiacs (five red and ten white). You are allocated a car at random. You 
see from a distance that it is red. What is the probability that you have been given 
a Ford? 


Solution. We require P(F \ R). There are 25 cars of which 10 are Fords, so 
P(F) = ^ = 0.4, and ten are red, so P(R) = 0.4 also. The probability of a red 
car, given that it is a Ford, is 0.5, since half the Fords are red. So 

P(R)P(F | R) = P(F)P(R | F), 

0.4 x P(F | R) = 0.4 x 0.5, 

P(F | R) = 0.5. 




6.5 Bayes’ Formula and Applications 205 


Practice Exercise. Box A contains two blue pens and three red pens. Box Y 
contains two red pens and three blue pens. A box is chosen at random and a pen 
is chosen at random from it. If the pen is blue, what is the probability that the 
box was box X ? 


Tree Diagrams 


It is often convenient to use tree diagrams in Bayes’ formula problems. When the 
diagram is completed, the terms to be added for the denominator of (6.5) are on the 
right-hand side. 

The computation of Bayes’ formula can be carried out as follows. 

1. First construct a tree diagram with the possible outcomes B\, # 2 , • • •, Bk as 
branches. 

2. To each of these branches add the branches A (“the event A occurs”) and 
A (“A does not occur”). The paths ending in A together represent all the cir¬ 
cumstances in which A can occur. 

3. Now calculate the probability—the product of the conditional probabilities—for 
each branch that ends in A. The sum of these probabilities will be P(A), the 
denominator of (6.5). 

4. The numerator P(B t | A) will be written next to one of the branches. 

Sample Problem 6.25. Construct a tree diagram for Sample Problem 6.23. 


Solution. In the terminology of experiments, Sample Problem 6.23 can be de¬ 
scribed as follows. First, a supplier is chosen; then a light bulb is chosen from 
that supplier. So the outcomes of the first experiment (the B\, B 2 , ...) are fac¬ 
tories X , Y, or Z. The first part of the tree diagram has three branches labeled 
X , Y, and Z, with probabilities 0.4, 0.35, and 0.25, respectively. Event A is “the 
bulb is faulty,” and the three probabilities are 0.01, 0.02, and 0.03. So we obtain 
the diagram of Figure 6.5. 

From the Figure, the denominator is 0.0185. So 


P{Z | A) = 


P(A | Z)P{Z) 
0.0185 


0.0075 

0.0185 


0.676. 


Practice Exercise. Repeat the above for Sample Problem 6.24. 


Sample Problem 6.26. The following table shows the proportion of people over 
18 who are in various age categories, together with the probability that a person 
in a given category will vote in a given election. A vote is selected at random. 
What is the probability that the voter was from the 18-24 age group? 
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V 

N 

V 
N 

V 
N 

V 

N 


P(V\A) P(A) = .089 


P(V\B) P(B) = 21A 


P(V\C ) P(C)= .178 


P(V\D) P(D) = .112 


Fig. 6.6. Tree diagram for Sample Problem 6.26 


Age 

Proportion 

Probability 

A: 18-24 

18.2% 

0.49 

B: 25^14 

38.6% 

0.71 

C: 45-64 

28.2% 

0.63 

D: 65+ 

15.2% 

0.74 


Solution. We use the diagram shown in Figure 6.6. (V and N mean “voter” and 
“nonvoter” respectively). 
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A 

B 


X 

0.004 

0.396 

0.4 

Y 

0.007 

0.343 

0.35 

Z 

0.0075 

0.2425 

0.25 


0.0185 

0.9815 



Fig. 6.7. Box diagram of data for Sample Problem 6.23 


Practice Exercise. What is the probability that the voter was 65 or older? 


Box Diagrams 

In many examples it is easiest to represent a two-stage experiment using a box di¬ 
agram. The outcomes of the first experiment are used as labels for rows, and the 
outcomes of the second experiment are used as labels for columns. Where row A 
meets column B , we put a box containing P(A Pi B). The sum of entries in row A, 
which equals P(A ), is written at the end of row A, and so on. 

As an example, consider Sample Problem 6.23. The first experiment— 
determination of factory—has outcomes X , Y, and Z; the second experiment— 
“check: is it faulty”—has outcomes A (faulty) and B (okay). Then 

P(A H X) = P(A | X)P(X) = 0.004, 

P(A H Y) = P(A | Y)P(Y) = 0.007, 

P(A nz) = P(A I Z)P(Z) = 0.0075. 

(See Figure 6.5 for these calculations.) 

We can calculate P(B D X) in various ways. For example, A and B are comple¬ 
ments, so 


P(A H X) + P(B H X) = P(X), 

p(B nx) = P(X) - P(A n x) 


= 0.4 - 0.004 = 0.396; 

similarly P(B HY) = 0.343, P(B Pi Z) = 0.2425. So the box diagram is as shown 
in Figure 6.7. 

Sample Problem 6.27. Use the box diagram in Figure 6.7 to calculate P(Z \ A) 
in Sample Problem 6.23. 

Solution. To find P(Z \ A), look at column A, which has a total of 0.0185. Then 
look at the (Z, A) entry 0.0075, and take the ratio 
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P(Z | A) = 


p(A n z) 
P(Z) 


0075 

0.0185 


0.406. 


Practice Exercise. Produce a box diagram for the Practice Exercise following 
Sample Problem 6.23 and use it to find the probability that you have tossed the 
biased coin, given that it shows heads. 


Medical Testing 

Probability theory is often misused in the press. One particular area where probabilis¬ 
tic ideas are mentioned, but not properly analyzed, is in the discussion of tests for 
disease and drugs. This is a very important topic nowadays, when compulsory drug 
testing is becoming more common in everyday life and testing for diseases such as 
AIDS is also very important. 

You may see in a newspaper that a certain test is “90% accurate.” And you would 
most likely think this means that, if you test positive, there is a 90% probability that 
you have the disease in question. But this is not so, as the next example shows. 

Sample Problem 6.28. A test for a venereal disease is 90% accurate: if you have 
the disease, the probability is 0.9 that you will test positive; and if you do not 
have the disease, the probability is 0.9 that you will test negative. In the whole 
population, 1 person in 100 has the disease. If a given person tests positive, what 
is the chance that he has the disease ? 


Solution. We shall use the abbreviations T (tests positive), N (tests negative), D 
(has the disease), and H (is healthy). We want to find P(D \ T). From the data, 
we know 

P(T | D) = 0.9, P(N | D) = 0.1, 

P(T | H) = 0.1, P(N | H) = 0.9, 

P(D) = 0.01, P(H) = 0.99. 


So 


P(D | T) = 


P(T | D)P(D) 

P(T | H)P(H) + P(T | D)P(D) 
(0.9)(0.01) 


(0.1)(0.99) + (0.9)(0.01) 
0.009 _ 0.009 

0.099 + 0.009 ~ 0.108 = 


0.083. 


So, even if you test positive, it is still most unlikely that you have the disease. 
This example can be represented by the following box diagram: 



P 

N 


D 

0.009 

0.001 

0.01 

H 

0.099 

0.891 

0.99 


0.108 

0.892 
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Practice Exercise. A disease has infected 5% of the population. The test is 95% 
effective—95% of those with the disease test positive, and 95% of those without 
the disease test negative. If your test shows a positive result, what is the proba¬ 
bility that you have the disease? 


The Monty Hall Problem 

Marilyn vos Savant’s Ask Marilyn column in Parade Magazine for September 9th, 
1990, contained a puzzle that generated a lot of interest. It is, in fact, a version of an 
older problem called the Monty Hall Problem , named for the game show host. 

The climax of a TV game show is run as follows. The contestant is given a 
choice of three numbered doors. Behind one closed door is a valuable new 
car; behind each of the others is a nearly worthless goat. The contestant is 
to choose a door, and wins the prize behind it. However, after the choice is 
announced but before the door is opened, the host opens one of the other two 
doors (not the one she chose) and reveals a goat. (Of course the host knows 
where the car is.) He then asks, “Do you want to stay with your original 
choice? Or would you rather switch to the third door?” 

Well, should the contestant stay or switch? Or does it not matter? 

Before analyzing the problem, we need to agree on three points. First, the car is 
placed behind the doors at random, so that on any given night the chance that it is 
behind any particular door is Second, the game always proceeds in the same way: 
the host always opens a door to show a goat, then offers the switch. Third, on those 
nights when the contestant’s first choice is the door with the car, there is an equal 
chance that the host will open either of the other two doors. 

Without loss of generality, let us suppose the contestant chooses door 1 and the 
host opens door 2. We write Cl, C2, and C3 as abbreviations for “the car is behind 
door 1,” “the car is behind door 2,” and “the car is behind door 3,” and H 2, H 3 for 
“the host opens door 2,” “the host opens door 3.” (He cannot open door 1.) Then 
what we want to know is 


P(C 3 | HI) > P(C 1 | H2)l 

If so, the contestant should switch, otherwise not. 

We know the probabilities of the host’s actions, given the position of the car. If 
the car is behind door 1, she is equally likely to open either door, so 

P(H2 | Cl) = P(H3 | Cl) = 

In the other cases she must open the remaining “goat” door, so 


P(H2 | C2) = P(H 3 | C3) = 0, 
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P(H2 | C3) = P(H3 | C2) = 1. 

Moreover we know F(C1) = P(C2) = P(C 3) = 

Now the calculation of the probabilities is a simple application of Bayes’ for¬ 
mula. First 


P(H 2) = P(H2\C1)P(C1) + P(H2\C2)P(C2) + P(H2\C3)P(C3) 



and similarly P(H 3) = So 


P(C 1 | #2) 


F(H2 | Cl)P(Cl) 

P(tf2) 


P(C3 | #2) 


P(H2 | C3)P(C3) 
P(H2) 


(b(b 

1 

2 

ax?) 

(b 


1 

3’ 

2 

3' 


So the odds are 2 to 1 in favor of switching. You may find this very surprising— 
intuitively, you might argue that “the car was equally likely to be behind any of the 
doors, so the probabilities are still equal,” but the host’s choice of doors has actually 
given you some information. 

The most important assumption in this discussion is that when the contestant has 
chosen correctly, the host is equally likely to open either door. Suppose this were not 
true—for example, suppose he always chooses the lowest numbered available door. 
Then if the contestant chooses door 1 and the host opens door 3, she should always 
switch; but if he opens door 2, there is no advantage (or disadvantage) in switching. 

This problem is particularly well suited to analysis by box diagrams. Assuming 
that the contestant chooses door 1, then the fact that the car is equally likely to be 
behind any door means that the diagram looks like 



Cl 

C2 

C3 


H2 

* 

* 

* 

* 

H3 

* 

* 

* 

* 


l 

1 

1 



3 

3 

3 



(where asterisks represent the numbers that have not yet been determined). Since the 
host never opens the door that hides the car, the (H 2, C2) and (H 3, C3) entries must 
be zero, and to make the column sums come out right we have 


* 

* 


Cl 


C2 


C3 


H2 


H 3 
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Assuming the host chooses at random when the car is behind door 1, we can 
make the two missing numbers in column Cl equal, and we have 



Cl 

C2 

C3 


HI 

l 

6 

0 

l 

3 

1 

2 

H3 

1 

6 

l 

3 

0 

1 

2 


1 

1 

1 



3 

3 

3 



When the host opens door 2, we need only look at the H2 row to see that the 
odds are 2 to 1 to jr) in favor of switching, and similarly if he opens door 3. 

Exercises 6.5 

1. The events E and F satisfy P(E) = 0.6, P(F \ E) = 0.5, and P(F \ E) = 
0.75. Find P(E \ F ) and P(E \ F ). 

2. The events E and F satisfy P(E) = 0.85, P(F \ E) = 0.8, and P(F \ E) = 
0.8. Find P(E \ F ) and P( E \ F ). 

3. One experiment has the possible outcomes A, B, and C; a second experiment 
has the possible outcomes E and F. Find P(A \ E ), P(B \ E ), P(C \ E ), 
P(A | F), P(B | F ), and P(C \ F) in each of the following cases: 

(i) P(A) = 0.4, P(B) = 0.4, P(E | A) = 0.25, P(E \ B) = 0.75, and 
P(E ) =0.6. 

(ii) P(A) = 0.25, P(B) = 0.25, P(£ | A) = 0.4, P(£ | B) = 0.4, and 
P(E ) =0.5. 

(iii) P(A) = ,P(B) = P(E | A) = i, P(E | fi) = 0, and P(E) = 

4. One experiment has the possible outcomes A, B, and C; a second experiment 

has the possible outcomes E and E. Find P(A \ E ), | E ), P(C | £), 

P(A | F), F(F | F), and F(C | F) in each of the following cases: 

(i) P(A) = P(B) = i, P(E | A) = l P(F | B) = l and 
P(F | C) = i. 

(ii) P(A) = 0.25, P(B) = 0.25, P(E \ A) = 0.6, P(F \ B) = 0.6, and 
P(F | C) = 0.5. 

(iii) P(A) = 0.5, P(B ) = 0.3, P(E | A) = 0.6, P(F B) = 0, and 
P(F | C) = 0.5. 

5. An automobile dealership finds that 4% of their customers default on payments, 
so that the car must be repossessed. On analyzing the records, it is found that 
among those who did not default, 40% made a large downpayment ($2000 or 
more), while only 10% of those who later defaulted made a large downpayment. 

(i) Suppose a customer makes a large downpayment. What is the probability 
that he will default on payments? 
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(ii) If a customer makes a small downpayment, what is the probability that he 
will default on payments? 

6. A builder buys tiles from two companies, A and B; he gets 80% from A and 
20% from B. He finds that 98% of the tiles he gets from A are undamaged, 
while 96% of those from B are undamaged. If he finds a damaged tile, what is 
the probability that it came 

(i) from A? 

(ii) fromB? 

7. In manufacturing metal office equipment, your company uses nuts supplied by 
three companies, A, B, and C. Company A supplies 30%, company B supplies 
45%, and company C supplies 25%. It is known that on average the following 
percentages of the nuts are defective: 1% of those from A, 1.5% of those from B, 
and 0.5% of those from C. 

(i) If a nut is selected at random, what is the probability that it is defective? 

(ii) If a nut is selected at random and is found to be defective, what is the prob¬ 
ability that it was made by company B? 

8. Suppose the shipping department of a company has three workers who prepare 
shipping labels. They prepare 60%, 30%, and 10% of the labels, respectively. 
The respective percentages of errors are 3%, 5%, and 10%. Find the probability 
that an incorrect label is due to the first person. Also find the probability that an 
incorrect label is due to each of the other persons. 

9. In a certain city, it is found that equally many people have fair and dark hair. 
A survey shows that 20% of people with dark hair and 40% of people with fair 
hair have blue eyes. A person is chosen at random from the population and is 
found to have blue eyes. What is the probability that this person has fair hair? 

10. Box 1 contains three red pens and four blue pens. Box 2 contains four red pens, 
one green pen, and three black pens. A pen is chosen from Box 1 and then it is 
placed in Box 2. Then a box is chosen at random and a pen is selected. 

(i) What is the probability that the pen is blue? 

(ii) If the pen is blue, what is the probability it originally came from Box 1? 

(Hint: Work this problem as though there were two types of blue pen: light blue 
initially in Box 1, dark blue initially in Box 2.) 

11. In Exercise 10, suppose Box 2 is the one from which a pen in finally selected. 

(i) If the pen drawn is blue, what is the probability that the pen drawn earlier 
from Box 1 was red? 

(ii) If the pen drawn is red, what is the probability that the pen drawn earlier 
from Box 1 was red? 
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12. In a manufacturing plant, three machines, A, B, and C, produce 50%, 35%, and 
15%, respectively, of the total production. The quality control department of the 
company has determined that 1% of the items produced by Machine A and 2% 
of the items produced by each of Machines B and C are defective. If an item is 
selected at random and found to be defective, what is the probability that it was 
produced by Machine B? 

13. In a factory, 30% of the workers smoke. It is found that smokers have three times 
the absentee rate of other workers. If a worker is absent, what is the probability 
that he is a smoker? 

14. An auto insurance company classifies 20% of its drivers as good risks, 60% as 
medium risks, and 20% as bad risks (called classes A, B, and C, respectively). 
The probability of at least one accident in a given year is 1% for class C, 0.5% 
for class B, and 0.1% for class A. If one of their insureds has an accident this 
year, what is the probability that he is a class B driver? 

15. A class contains 60% women. It is found that 12% of the men students and 7% 
of the women students are left-handed. A student is chosen at random. If the 
student is left-handed, what is the probability that the student is male? 

16. A school tax proposition is submitted to voters. The voters’ registered party affil¬ 
iation as a percentage of all voters, and the percentage of each group who voted 
in favor of the proposition, are as follows: 


Party 

Registration 

In Favor 

Democrat 

40 

70 

Republican 

40 

20 

Independent 

10 

80 

Other 

10 

50 


A voter is selected at random. 

(i) What is the probability that she voted in favor of the proposition? 

(ii) If she voted in favor of the proposition, what is the probability that she is a 
Democrat? 

(iii) If she voted against the proposition, what is the probability that she is not 
a Republican? 

17. Among one strain of cattle in Texas, it is estimated that 30% of the bulls suffer 
from Ruckel’s disease. A blood test has been developed to detect the disease. If 
a bull has the disease, then the test gives a positive response 80% of the time. If 
a bull does not have the disease, the test is positive in 10% of cases. If a bull is 
tested for the disease and the test is positive, what is the probability that he really 
does not have the disease? 
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18. A test is positive for 94% of subjects who have a certain disease and negative for 
96% of those who do not have the disease. If 4% of the population suffers from 
the disease, then 

(i) if the test is positive, what is the probability that the patient actually has the 
disease? 

(ii) if the test is negative, what is the probability that the patient does not have 
the disease? 

19. A smallpox vaccine produces total immunity in 95% of cases. If the vaccination 
does not produce immunity, then it has no effect—the person’s probability of 
catching smallpox is the same as that of an unvaccinated person. Suppose 30% 
of the population have been vaccinated. If a person contracts smallpox, what is 
the probability that she had been vaccinated? 

20. A laboratory test for a particular disease tests positive 95% of the time when a 
person has the disease, and tests negative 98% of the time when the person does 
not have the disease. In general 0.1% of the population has the disease. 

(i) What is the probability that a randomly selected person who tested positive 
did not have the disease? 

(ii) What is the probability that a randomly selected person who tested negative 
actually had the disease? 

21. Suppose the “Monty Hall” game involved four doors, not three. What is the 
probability of winning if you switch? What is the probability of winning if you 
do not switch? 

22. Suppose there are three cards in a hat. One is red on both sides, one black on both 
sides, and the third has one side red and one side black. One card is withdrawn, 
and one side is observed to be red. What is the chance that the other side is red? 
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Graph Theory 


The word graph is used in discrete mathematics for a mathematical object consisting 
of a set of objects together with a way in which some pairs of objects are related. So 
we could say it represents the most general type of symmetric binary relation. 

This simple structure has very widespread applications. For example, graphs can 
be used to represent road systems, airline routes, and water pipelines. They have a 
number of applications in routing and scheduling. They are important in computer 
science as models of structures (such as VLSI chips) and of processes (in the design 
of operating systems, for example) and also as data structures. 


7.1 Introduction to Graphs 

Graphs 

Any binary relation can be represented by a diagram. Suppose ~ is a binary relation 
on the set S. The elements of S are shown as points (called vertices ), and if v ~ y is 
true, then a line (called an edge) is shown from x to y, with its direction indicated by 
an arrow. Provided the set S is finite, all information about any binary relation on S 
can be shown in the diagram. 

The diagram of a binary relation on a finite set is called a directed graph or 
digraph. If v ~ x is true for at least one vertex x, the diagram is a looped digraph; if 
v ~ v never occurs (~ is antireflexive), then the digraph is called simple. 

In the case where the relation is symmetric, the arrows can be omitted from its 
diagram. The diagram of a symmetric, antireflexive binary relation on a finite set is 
called a graph. The phrase “looped graph” can be used to describe the diagram of 
a symmetric relation in which x ~ x is sometimes true, but be careful as a “looped 
graph” is not really a graph! Similarly, “infinite graph” means an object defined like 
a graph in which the vertex set can be infinite. 


W.D. Wallis, A Beginner's Guide to Discrete Mathematics , 

DOI 10.1007/978-0-8176-8286-6_7, © Springer Science+Business Media, LLC 2012 
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O 


z 


A 


B 


C 


Fig. 7.1. Graphical representations of three relations 


Figure 7.1 shows examples of these representations, illustrating relations be¬ 
tween three objects, x, y, and z- Example A is the digraph representing the case 
where the true relations are x ~ y, y ~ z, and z ~ y. Example B is a graph; x ~ y, 
y ~ x, y ~ z, and z ~ y are all true. In C, jc ~ y, y ~ x, and y ~ y the result is a 
looped graph. 

Two vertices are either related or not, so the diagram of a graph can never con¬ 
tain two lines joining the same pair of vertices. However, there are some applications 
where two edges between the same vertices might make sense. (For example, see the 
applications to road systems in the next section.) For this reason we sometimes talk 
about multigraphs in which there can be several edges joining the same pair of ver¬ 
tices; those edges are called multiple edges. (Similarly, one can have multiple edges 
in the same direction in a directed structure; the somewhat clumsy word “multidi¬ 
graph” has been used.) When discussing multigraphs, the word “simple” means that 
neither loops nor multiple edges occur. 

Sample Problem 7.1. The binary relations <, =, and ~ are defined on the 

set S = {1, 2, 3, 4} as follows: 

(i) v < y means xis less than y; 

(ii) jc = y means x is congruent to y(mod 2) and x^y; 

(iii) jc ~ y means x = y d= 1; 

(iv) x ~ y means y = x 2 . 

Draw the diagrams associated with <, =, and Which are graphs or simple 

digraphs? 

Solution. The diagrams are shown below. Relations = and ~ yield graphs, 

< gives a simple digraph, and ~ a looped digraph. 



4a 


1 c> 



(i) 


(ii) 


(iii) 


(iv) 


Practice Exercise. The binary relations =, and ~ are defined on S by 
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(i) x = y means x + y = 4, 

(ii) x ~ y means x = y + 1, 

(iii) x ~ y means y / x. 

Draw the diagrams associated with =, and Which are graphs or simple 

digraphs? 

We make a formal definition and set up some notation as follows: a graph G 
consists of a finite set V of objects called vertices , together with a symmetric, an¬ 
tireflexive binary relation called adjacency , written Associated with G is a set E 
of unordered pairs of elements of V , called edges. {x,y} is an edge if and only if 
v ~ y. Vertices x and y are called the endpoints of the edge {x, y}, and we say this 
edge joins x to y. If a graph G is under discussion, we usually write V(G) to mean 
the set of vertices of G and E(G) for its set of edges. 

In the diagram representing a graph, an edge {x, y} is shown as a line from x 
to y. This is usually drawn as a straight line segment, but not always; the only im¬ 
portant thing is that it joins the correct points. Moreover, the position of the points 
is not fixed. For these two reasons, one graph can give rise to several drawings that 
look quite dissimilar. For example, the three diagrams in Figure 7.2 all represent the 
same graph. Although the two diagonal lines cross in the first picture, their point of 
intersection does not represent a vertex of the graph. 


Some Special Graphs 

If G is a graph, it is possible to choose some of the vertices and some of the edges 
of G in such a way that these vertices and edges again form a graph, say H. H is 
then called a subgraph of G; one writes H < G. Clearly every graph G has itself as 
a subgraph; we say a subgraph H is a proper subgraph of G, and write H < G, if it 
does not equal G. 

Given a set S of v vertices, the graph formed by joining each pair of vertices in 
S is called the complete graph on S and denoted Ks. We also write K v to mean any 
complete graph with v vertices. The three drawings in Figure 7.2 are all representa¬ 
tions of K\. 

Given any graph G, the set of all edges of Ky(G ) that are not edges of G will 
form a graph with V (G) as its vertex set; this new graph is called the complement 
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Fig. 7.3. A graph and its complement 



Fig. 7.4. K 3A 


of G and written G. An example is shown in Figure 7.3. More generally, if G is a 
subgraph of H, then the graph formed by deleting all edges of G from H is called 
the complement of G in H, denoted H — G. The complement Ks of the complete 
graph Ks on vertex set S is called an empty graph ; we also write K v for an empty 
graph with v vertices. 

The complete bipartite graph on V\ and V 2 has two disjoint sets of vertices, V\ 
and V 2 ; two vertices are adjacent if and only if they lie in different sets. We write 
K m , n to mean a complete bipartite graph with m vertices in one set and n in the 
other. Figure 7.4 shows K 2A \ K\, n in particular is called an n-star. Any subgraph of 
a complete bipartite graph is called bipartite. 

A graph is called disconnected if its vertex set can be partitioned into two sub¬ 
sets, V\ and V 2 , that have no common element, in such a way that there is no edge 
with one endpoint in V\ and the other in V 2 . If a graph is not disconnected, then 
it is connected. A disconnected graph consists of a number of disjoint subgraphs; 
a maximal connected subgraph is called a component. 

The complement of the complete bipartite graph K m , n is a disconnected graph. 
It consists of two components, a K m and a K n . This graph is denoted K m U K n . 
More generally, if G and H are any two graphs, the graph formed by putting the two 
together is called the union of G and H, and is written G U H. When G and H have 
no common vertex, we say it is a disjoint union. So we can say, “the complement of 
K m , n is the disjoint union of a K m and a K n r 


Degree 

We define the degree or valency d(x) of the vertex x to be the number of edges that 
have x as an endpoint. If d(x) = 0, then x is called an isolated vertex, while a vertex 
of degree 1 is called a pendant vertex. The edge incident with a pendant vertex is 




7.1 Introduction to Graphs 219 



Fig. 7.5. What are the degrees? 


called a pendant edge. A graph is called regular if all its vertices have the same 
degree. If the common degree is r, it is called r-regular. In particular, a three-regular 
graph is called cubic. We write 8 (G) for the smallest of all degrees of vertices of G, 
and A(G) for the largest. (One also writes either A(G) or 8 (G) for the common 
degree of a regular graph G.) 

Sample Problem 7.2. What are the degrees of the vertices in graphs (ii) and (iii) 
of Sample Problem 7.1 ? What are the degrees of the vertices of 4 ? 

Solution. In graph (ii), every vertex has degree 2. In graph (iii), vertices 1 and 4 
have degree 1, while 2 and 3 have degree 2. K 3 4 has three vertices of degree 4 
(the upper vertices in Figure 7.4), while the other four have degree 3. 

Practice Exercise. What are the degrees of the vertices on the graphs in Fig¬ 
ure 7.5? 


Theorem 43. In any graph or multigraph, the sum of the degrees of the vertices 
equals twice the number of edges (and consequently the sum of the degrees is an 
even integer). 

Proof. Suppose the graph or multigraph has e edges; label the edges, say yi, y 2 , 
..., y e . Consider a list in which each edge appears twice, once for each endpoint. 
For example, if y\ has endpoints X 4 and xj, you might make entries y\ : X 4 and 
yi : JC 7 . Vertex v will appear in precisely d(x) entries, so the total number of entries 
equals the sum of the degrees of the vertices. On the other hand, each edge appears 
twice, so the total number of entries equals twice the number of edges. □ 

Corollary 1. In any graph or multigraph, the number of vertices of odd degree is 
even. In particular, a regular graph of odd degree has an even number of vertices. 

A collection of v nonnegative integers is called graphical if and only if there is 
a graph on v vertices whose degrees are the members of the collection. A graphical 
collection is called valid if and only if there is a connected graph with those degrees. 

Sample Problem 7.3. Which of the following collections are graphical? 


3, 3, 2, 2,0 4,3, 1,0 3, 2, 2, 2, 2 4, 3, 3, 3, 2, 2, 1 4,4, 3,2, 1. 
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Solution. The first and fourth are graphical; see (i) and (ii). 


3 2 3 4 2 2 



3 2 0 3 3 1 

(i) (ii) 


(Notice the isolated vertex, corresponding to degree 0.) 4, 3, 1, 0 is impossible 
because there are only four vertices, so no vertex can have degree greater than 3. 
The collection 3, 2, 2, 2, 2 is impossible because the sum of the degrees is odd. 
And 4, 4, 3, 2, 1 is impossible. If there are five vertices and two of them have 
degree 4, the other vertices must all be adjacent to those two; certainly no vertex 
can have degree smaller than 2 . 

Practice Exercise. Which of the following collections are graphical? 

2,2,2, 1, 1 5,3,3, 1, 1, 1 4, 3,2,2, 1, 1, 1. 

This is one situation where the distinction between graphs and multigraphs is 
important. For example, even though 4, 3, 1, 0 is not graphical, there is a multig raph 
with those degrees. 


4 3 
i o o o 


Exercises 7.1 

In Exercises 1 to 8 , draw the diagram of the binary relation defined on the set 
(—2, —1,0, \,2}. Is it simple? Is it directed? 

1 . x ~ y means x + y <2. 

2 . x ~ y means x + y < 6 . 

3. v ~ y means x = y + 1. 

4. v ~ y means x = dry. 

5. v ~ y means x < y 2 . 

6. x ~ y means x + y > 0 . 

7. x ~ y means x + y is odd. 

8 . v ~ y means vy is odd. 














7.1 Introduction to Graphs 221 


In Exercises 9 to 15, ~ is defined on the positive integers. Is the corresponding dia¬ 
gram simple? Is it directed? 

9. x ~ y means x + y is odd. 

10 . x ~ y means x divides y. 

11 . x ~ y means x and y have greatest common divisor 1. 

12. x ~ y means x + y < 4. 

13. x ~ y means x and y are both prime numbers. 

14. x ~ y means x = dry. 

15. x ~ y means xy is odd. 

16. What are the degrees of the vertices in these graphs? 


(i) (ii) (iii) (iv) 

17. What are the degrees of the vertices in these graphs? 

& W 

(i) (ii) (iii) (iv) 

18. Suppose G has v vertices and 8 (G) > Prove that G is connected. 

19. How many edges does the star K\, n have? What are the degrees of its vertices? 

20. The n-wheel W n has n + 1 vertices {xo, x\, ..., x n }\ xo is joined to every other 
vertex and the other edges are 

XiX2,X2X3, . . . ,X n -iX n ,X n Xi. 

How many edges does W n have? What are the degrees of its vertices? 

21. Prove that the collection {3, 3, 2, 1, 1} is graphical and valid. Find a connected 
graph with this collection of degrees. 

22. Prove that the collection {3, 2, 2, 2, 1} is graphical and valid. Find a connected 
graph with this collection of degrees. 






In Exercises 23 to 30, is the collection graphical? If it is graphical, is it valid? 

23. {5, 5, 4, 4, 2, 2}. 24. {3, 3, 2, 2, 2, 1}. 
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25. {5,4,4, 3,3,3}. 
27. {5, 4, 4, 4, 3, 3}. 

29. {2,2, 1, 1, 1, 1}. 


26. {4, 3, 2, 2, 2, 1}. 
28. {5, 4, 4, 4, 2, 1}. 

30. {3, 3, 2, 2, 1, 1}. 


31. Prove that if G is not a connected graph, then G is connected. 

32. Prove that a regular graph of odd degree can have no component with an odd 
number of vertices. 

33. Prove that no graph has all its vertices of different degrees. 


7.2 The Konigsberg Bridges; Traversability 

Modeling Road Systems 

Suppose we want to describe the roads joining various towns to plan an itinerary. 
If our only desire is to list the towns that will be visited in order, we do not need 
to know the physical properties of the region, such as hills, whether different roads 
cross, or whether there are overpasses. The important information is whether or not 
there is a road joining a given pair of towns. In these cases we can use a complete 
road map, with the exact shapes of the roads and various other details shown, but 
it will be less confusing to make a diagram as shown in Figure 7.6(a) that indicates 
roads joining B to C, A to each of B and C, and C to D, with no direct roads joining 
A to D or B to D. The diagram is obviously a graph whose vertices are the towns 
and whose edges are the road links. 

In some cases there is a choice of routes between two towns. This information 
may be important: you might prefer the freeway, which is faster, or you might prefer 
the more scenic coast road. In this case the graph does not contain enough informa¬ 
tion. For example, if there had been two different ways to travel from B to C, this 
information can be represented as in the diagram of Figure 7.6(b). This new diagram 
is a multi graph', that is, there is a multiple edge (of multiplicity 2) joining B to C. 

Sample Problem 7.4. Suppose the road system connecting towns A, B, and C 
consists of two roads from A to B, one road from B to C, and a bypass road 
directly from A to C. Represent this road system graphically. 


Solution. 






7.2 The Konigsberg Bridges; Traversability 223 




B 


D 


Fig. 7.6. Representing a road system 



Fig. 7.7. The Konigsberg bridges 


Practice Exercise. Represent graphically the road system connecting towns A, 
B , C, D , with one road from A to B, one from A to C, two from A to D, and one 
from B to C. 


The Konigsberg Bridges 

The first mathematical paper on graph theory was published by the great Swiss math¬ 
ematician Leonhard Euler in 1736, and had been delivered by him to the St. Peters¬ 
burg Academy one year earlier. 

Euler’s paper grew out of a famous old problem. The town of Konigsberg in 
Prussia is built on the river Pregel. The river divides the town into four parts, in¬ 
cluding an island called The Kneiphof, and in the eighteenth century the town had 
seven bridges; the layout is shown in Figure 7.7. The question under discussion was 
whether it was possible from any point on Konigsberg to take a walk in such a way 
as to cross each bridge exactly once. 

Euler set for himself the more general problem as follows: given any configura¬ 
tion of river, islands, and bridges, find a general rule for deciding whether there is a 
walk that covers each bridge precisely once. 

We first show that it is impossible to walk over the bridges of Konigsberg. Sup¬ 
pose there was such a walk. There are three bridges leading to the area C : you can 
traverse two of these, one leading into C and the other leading out, at one time in 
your tour. There is only one bridge left: if you cross it going into C, then you cannot 
leave C again, unless you use one of the bridges twice, so C must be the finish of the 
walk; if you cross it in the other direction, C must be the start of the walk. In either 
event, C is either the place where you start or the place where you finish. 
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Fig. 7.8. The multigraph representing the Konigsberg bridges 


A similar analysis can be applied to A, B, and D since each has an odd number 
of bridges—five for B and three for the others. But any walk starts at one place and 
finishes at one place; either there can be two endpoints, or the walk can start and 
finish at the same place. Therefore it is impossible for A, B, C, and D all to be either 
the start or the finish. 

Euler started by finding a multigraph that models the Konigsberg bridge problem 
(considered as a road network, with the islands and river banks as separate “towns”). 
Vertices A, B , C, and D represent the parts A, B, C, and D of the town, and each 
bridge is represented by an edge. The multigraph is shown in Figure 7.8. In terms of 
this model, the original problem becomes can a simple walk be found that contains 
every edge of the multigraph? 

These ideas can be applied to more general configurations of bridges and islands, 
and to other problems. A simple walk that contains every edge of a given multigraph 
will be called an Euler walk in the multigraph. Finding whether a given multigraph 
(or a given road network) has an Euler walk is called the traversability problem. 

Sample Problem 7.5. The islands A, B, C, D are joined by seven bridges—two 
joining A to B, two from C to D, and one each joining the pairs AC, AD, 
and BD. Represent the system graphically. Can one walk through this system, 
crossing each bridge exactly once ? 

Solution. Islands B and D each have an odd of 
bridges, so one must be the start of the walk and 
the other the finish. With a little experimenta¬ 
tion you will find a solution—one example is 
B AC DAB DC and there are others. 


Practice Exercise. The islands X, Y, Z, T are joined by seven bridges—three 
joining A to Y, and one each joining the pairs XZ , XT, and YZ. Represent 
the system graphically. Can one walk through this system, crossing each bridge 
exactly once? 

In proving that a solution to the Konigsberg bridge problem is impossible, we 
used the fact that certain vertices had an odd number of edges incident with them. 
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(The precise number was not important; oddness was the significant feature.) Let us 
call a vertex even if its degree is even, and odd otherwise. It was observed that an 
odd vertex must be either the first or the last point in the walk. In fact, if a multigraph 
is traversable, then either the multigraph has two odd vertices, the start and finish of 
the Euler walk, or the multigraph has no odd vertices, and the Euler walk starts and 
finishes at the same point. Another obvious necessary condition is that the multigraph 
must be connected. These two conditions are together sufficient. 

Theorem 44. If a connected multigraph has no odd vertices, then it has a Euler walk 
starting from any given point and finishing at that point. If a connected multigraph 
has two odd vertices, then it has a Euler walk whose start and finish are the odd 
vertices. 

(A closed Euler walk—one that starts and ends at the same point—is called a 
Euler circuit.) 

Proof. Consider any simple walk in a multigraph that starts and finishes at the same 
vertex. If one erases every edge in that walk, one deletes two edges touching any 
vertex that was crossed once in the walk, four edges touching any vertex that was 
crossed twice, and so on. (For this purpose, count “start” and “finish” combined as 
one crossing.) In every case an even number of edges is deleted. 

First, consider a multigraph with no odd vertex. Select any vertex x, and select 
any edge incident with x. Go along this edge to its other endpoint, say y. Then 
choose any other edge incident with y. In general, on arriving at a vertex, select any 
edge incident with it that has not yet been used, and go along the edge to its other 
endpoint. At the moment when this walk has led into the vertex z, where z is not x, 
an odd number of edges touching z has been used up (the last edge to be followed, 
and an even number previously). Since z is even, there is at least one edge incident 
with z that is still available. Therefore, if the walk is continued until a further edge is 
impossible, the last vertex must be x —that is, the walk is closed. It will necessarily 
be a simple walk and it must contain every edge incident with x. 

Now assume that a connected multigraph with every vertex even is given, and a 
simple closed walk has been found in it by the method just described. Delete all the 
edges in the walk, forming a new multigraph. From the first paragraph of the proof 
it follows that every vertex of the new multigraph is even. It may be that we have 
erased every edge in the original multigraph; in that case we have already found 
a Euler walk. If there are edges still left, there must be at least one vertex, say c, 
that was in the original walk and that is still on an edge in the new multigraph—if 
there were no such vertex, then there can be no connection between the edges of the 
walk and the edges left in the new multigraph, and the original multigraph must have 
been disconnected. Select such a vertex c, and find a closed simple walk starting 
from c. Then unite the two walks as follows: at one place where the original walk 
contained c, insert the new walk. For example, if the two walks are 


x, y,..., z, c, u,..., x, 
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C 

F 

I 

L 


and 


c, s, ..., t, c, 


then the resulting walk will be 

x,y, ..., z, c, s, ... ,t, c,u, ..., x. 

(There may be more than one possible answer if c occurred more than once in the 
first walk. Any of the possibilities may be chosen.) The new walk is a closed simple 
walk in the original multigraph. Repeat the process of deletion, this time deleting the 
newly formed walk. Continue in this way. Each walk contains more edges than the 
preceding one, so the process cannot go on indefinitely. It must stop. But this will 
only happen when one of the walks contains all edges of the original multigraph, and 
that walk is a Euler walk. 

Finally, consider the case where there are two odd vertices p and q and every 
other vertex is even. Form a new multigraph by adding an edge pq to the original. 
This new multigraph has every vertex even. Find a closed Euler walk in it, choosing 
p as the first vertex and the new edge pq as the first edge. Then delete this first edge; 
the result is a Euler walk from q to p. □ 

It is clear that loops make no difference as to whether or not a graph has a Euler 
walk. If there is a loop at vertex x, it can be added to a walk at some traversing of v. 

A good application of Euler walks is planning the route of a highway inspector or 
mail contractor who must travel over all the roads in a highway system. Suppose the 
system is represented as a multigraph G, as was done earlier. Then the most efficient 
route will correspond to a Euler walk in G. 

Sample Problem 7.6. Find a Euler walk in the road network represented by Fig¬ 
ure 7.9(a,b). 

Solution. In the first example, starting from A, we find the walk AC BE A. 

When these edges are deleted (see Figure 7.10(a)) there are no edges remain¬ 
ing through A. We choose C, a vertex from the first walk that still has edges 
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adjacent to it, and trace the walk CFIHGDEC, after which there are no edges 
available at C (see Figure 7.10(a)). E is available, yielding walk EG JLIE. As 
is clear from Figure 7.10(c), the remaining edges form a walk HJKLH. 

We start with AC BE A. We replace C by CFIHGDEC , yielding 

ACEIHGDECBEA , 

then replace the first E by EG J LIE, with result 

ACEIHGDEGJ LIECBEA 

(we could equally well replace the second E). Finally H is replaced by HJKLH, 
and the Euler walk is 

ACE IHJKLHGDEGJ LIECBEA. 

In the second example, there are two odd vertices, namely B and E , so we add 
another edge B E and make it the first edge used. The first walk found was 

BEHGCABEDBCEB , 

and the second is DEGD, exhausting all the vertices and producing the walk 
BEHGCABEDEGDBCEB. 

The first edge (the new one we added) is now deleted, giving Euler walk 
EHGCABEDEGDBCEB , 


in the original graph. 

Practice Exercise. Find a Euler walk in the road network represented by Fig¬ 
ure 7.9(c). 
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Fig. 7.11. Find Etherizations 



Fig. 7.12. Finding an Eulerization 


Eulerization 

If G contains no Euler walk, the highway inspector must repeat some edges of the 
graph to return to his starting point. Let us define a Eulerization of G to be a multi¬ 
graph, with a closed Euler walk, that is formed from G by duplicating some edges. 
A good Eulerization is one that contains the minimum number of new edges, and 
this minimum number is the Eulerization number eu(G) of G. For example, if two 
adjacent vertices have an odd degree, you can add a further edge joining them. This 
will mean that the inspector must travel the road between them twice. However, if the 
two vertices are not adjacent, one new edge will not suffice—it would be the same 
as requiring that a new road be built! 

Sample Problem 7.7. Consider the multigraph G shown in Figure 7.1 1(a). What 
is eu(G)? Find an Eulerization of the road network represented by G that uses 
the minimum number of edges. 

Solution. Look at the representation of the multigraph on the left of Figure 7.12. 
The black vertices have an odd degree, so they need additional edges. As there 
are four black vertices, at least two new edges are needed; but obviously no two 
edges will suffice. However, there are solutions with three added edges—two 
examples are shown in Figure 7.12 —so eu(G) = 3. 

Practice Exercise. Consider the multigraph H shown in Figure 7.11(b). What 
is eu(H)l Find an Eulerization of the road network represented by H that uses 
the minimum number of edges. 
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Exercises 7.2 

In Exercises 1 to 16, decide whether the graph shown contains an Euler walk. If the 
graph contains a Euler walk, find one. Is it a Euler circuit? 

1 . 2 . 



7 . 


8 . 
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In Exercises 11 to 26, state the Eulerization number of the graph and find a good 
Eulerization. 


17 . 


18 . 























7.2 The Konigsberg Bridges; Traversability 231 



27. Suppose G is a connected graph with k vertices of odd degree. Show that G can 
be decomposed into k/2 edge-disjoint simple walks. 

28. Find a graph G such that both G and its complement G have Euler circuits. 
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Fig. 7.13. P 5 and C 5 


7.3 Walks, Paths, and Cycles 


To generalize the idea of a Euler walk, we shall define a walk in a graph G to be any 
finite sequence of vertices xo, x \,..., x n and edges a \, a 2 ,..., a n of G: 

XO 5 ft 1 5 Xi , a 2 , . . . , ft 72, X 72 , 

where the endpoints of ft* are Xj_i and x; for each i. The length of a walk is its 
number of edges. The walk shown above might be called an xo x n -walk of length n. 
A closed walk, sometimes called a circuit , is one in which the first and last vertices 
are the same. 

Walks are called simple if there is no repeated edge. So a Euler walk is a simple 
walk that contains every edge. A walk in which no vertex is repeated is a path. 

There is no such thing as a closed path—the last vertex is a repeat of the first— 
but we shall define a cycle to be a closed simple walk with no vertex repeats except 
for the endpoints. 

We also use the word “path” for a graph that consists of a single ft-vertex path, 
and write P n for such a graph. A “cycle” C n is defined similarly to mean a graph 
consisting of an ft-vertex cycle. If multiple edges are allowed, C 2 will mean a multi¬ 
graph with two vertices joined by a double edge, and if loops are allowed then C\ 
usually means a single vertex with a loop, but for graphs C n is only defined when 
ft > 3. C n is a cycle of length ft, while P n is a path of length n — 1. 

Figure 7.13 shows P 5 and C 5 . 

Suppose there is a walk from vertex x to vertex y in a graph. Maybe there are 
repeated vertices in the walk. But suppose z is a repeated vertex. You could follow 
the walk from x up to the first time you come to z, then go from the last occurrence 
of z along the walk to y. In this way you replace the walk by one in which z is not 
a repeated vertex. This can be done until every repeat has been eliminated. If x and 
y are different, the end result is a path from x to y. (If x = y, all the edges are 
eliminated.) 

Theorem 45. If there is a walk from vertex x to vertex y in the graph G, where x is 
not equal to y, then there is a path in G whose first vertex is x and whose last vertex 
is y. 



7.3 Walks, Paths, and Cycles 233 


Connectedness 

We shall say that two vertices are connected when there is a walk joining them. (The¬ 
orem 45 tells us we can replace the word “walk” by “path”) Two vertices of G are 
connected if and only if they lie in the same component of G; G is a connected graph 
if and only if all pairs of its vertices are connected. (This definition of “connected 
graph” is consistent with the one given in Section 7.1.) 

Among connected graphs, some are connected so slightly that the removal of a 
single vertex or edge will disconnect them. Such vertices and edges are quite impor¬ 
tant. A vertex v is called a cutpoint in G if G — x contains more components than 
does G; in particular, if G is connected, then a cutpoint is a vertex v such that G — x 
is disconnected. Similarly a bridge (or cut-edge) is an edge whose deletion increases 
the number of components. 

Sample Problem 7.8. If a graph represents a road network, what is the interpre¬ 
tation of a bridge? Why are such edges important? 

Solution. A bridge represents a road whose deletion would cut people in part of 
the network off from those in another part. (Often a real bridge, over a river, etc. 
plays this role.) It is important to recognize such roads so that extra effort can be 
taken to keep them open to traffic. For example, the highways department might 
give special priority to keeping them plowed in winter. 

Practice Exercise. In such a graph, what is the interpretation of a cutpoint? 


Connectivity 

Generalizing the idea of a cutpoint, we define the connectivity /c(G) of a graph G to 
be the smallest number of vertices whose removal from G results in either a discon¬ 
nected graph or a single vertex. (The latter special case is included to avoid prob¬ 
lems when discussing complete graphs.) If tc(G) > k , then G is called /^-connected. 
The edge-connectivity k'(G) is defined to be the minimum number of edges whose 
removal disconnects G (no special case is needed). In other words, the edge connec¬ 
tivity of G equals the size of the smallest cutset in G. From the definition, it is clear 
that the connectivity and edge connectivity of a graph is at least as great as that of 
any of its subgraphs. 

Sample Problem 7.9. What are k and k' for the graphs A, B in Figure 7.14? 

Solution. A has no cutpoint or bridge, so k(A) > 2 and ic'(A) > 2. Consider 
the leftmost vertex: deleting the two vertices adjacent to it, or the two edges 
containing it, will disconnect A. So k(A) = k'(A) = 2. The center vertex of 
B is a cutpoint, so k(B) = 1, but there is no bridge; however, the graph can be 
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A B CD 

Fig. 7.14. Examples for connectivity calculations 


disconnected by removing the two edges joining the center vertex from the left, 
so k'(B) = 2. 

Practice Exercise. Evaluate k and k' for the graphs C, D in Figure 7.14. 

The following theorem was proven by Whitney in 1932. Recall that 8(G) denotes 
the minimum degree of vertices of G. 

Theorem 46. For any graph G, 

k(G) < k\G ) < 8(G). 

The proof is beyond our scope, but the easiest part is set as an exercise. 

Sample Problem 7.10. Calculate 8 and verify Theorem 46 for the graphs A, B 
in Figure 7.14. 

Solution. A has minimum degree 2, so k(A), k'(A), 8(A) = 2, 2, 2. B has min¬ 
imum degree 3, so k(B), k'(B), 8(B) = 1, 2, 3. 

Practice Exercise. Do the same calculation for the graphs C, D. 

It is easy to see that all combinations of strictness are possible in Theorem 46: 
both of the inequalities can be strict, or one of them, or neither. For example, graphs 
A and B in Figure 7.14 illustrate the possibilities of k , k' and 8 being all equal or all 
different. 


Exercises 7.3 

1. Find all paths from v to t in the graph shown below. 


s a b c 



d 


e 


f t 
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2. Here is the Petersen graph , which arises in several contexts in the study of 
graphs. 



Find cycles of lengths 5, 6 , 8 , and 9 in this graph. 

3. A graph contains at least two vertices. There is exactly one vertex of degree 1, 
and every other vertex has degree 2 or greater. Prove that the graph contains a 
cycle. Is this always true if we allowed graphs to have infinite vertex sets? 

4. Find all bridges in the following graphs. 



5. What are the connectivity and edge connectivity of the graphs in Exercise 4? 
In Exercises 6 to 14, find the connectivity and edge connectivity of the given graph. 

6. 7. 



10 . 


o—o o—o o—o 

o—<>— 
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15. What are the connectivity and edge connectivity of the ^-vertex cycle C n l 

In Exercises 16 to 19, find examples {other than the examples in Figure 7.14) of 
graphs with the specified relationship between minimum degree, connectivity, and 
edge connectivity. 

16. k(G) = k'(G) = 8(G) . 

17. k(G) < k'(G ) = 8(G). 

18. k(G) = k'(G) < 8(G). 

19. k(G) < k'(G) < 8(G). 

20. Show that k'(G) < 8(G) in any graph G. 


7.4 Distances and Shortest Paths 

Distance 

If vertices x and y are adjacent, then there is a path of length 1 from x to y, and we 
shall say that the distance from x to y is 1. If x and y are connected, there may be 
several paths joining them, and we define the distance between them to be the length 
of the shortest path joining them; we denote it D(x, y). For completeness, we define 
D(x, x) to be 0. 

Suppose jc, y, and z are vertices in a connected graph. One could travel from x 
to y by first going from x to z to y (distance D(x, z)), then from x to z to y (distance 
D(z, y)), a total of D(x, z) + D(z, y). There may be a shorter path from x to y, but 
in any event 
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D(x,y) < D(x,z) + D(z,y), 
for any vertices x, y, and z in a connected graph. 

Sample Problem 7.11. Find the distance between every pair of vertices in the 
graph (a) below. 



(a) (b) 


Solution. There are two different kinds of vertices in the graph. In each part of 
the following figure one vertex is highlighted and the number at a vertex shows 
the distance from the highlighted one. 


0 1 3 4 1 0 2 3 



1 2 2 3 2 1 1 2 

Practice Exercise. Find the distance between every pair of vertices in the graph 
of graph (b). 

Distance arises in the example of a communications network. The vertices of a 
graph might represent computers. If two computers are directly linked, the corre¬ 
sponding vertices are adjacent so that edges represent direct links. Often messages 
from one computer to another are sent by way of another machine—e-mail mes¬ 
sages are often relayed through several intermediate computers. Two computers can 
communicate if the corresponding vertices are connected. 

The physical separation between the computers is irrelevant. The number of 
times the signal is relayed may well be significant: the more relays, the greater the 
chance of a delay. (Direct communication between computers, by media such as a 
telephone cable, is practically instantaneous, but delays may occur when an inter¬ 
mediate link is busy or down.) So the distance between vertices is a measure of the 
possibility of delay. 

In this example, we are often interested in the worst case —what is the worst 
delay that can occur? To measure this, we define the eccentricity e(x) of vertex x 
in a connected graph to be the largest value of D(x, y), where y ranges through all 
the vertices. The diameter D = D(G ) of G is the maximum value of e(x) for all 
vertices x of G, while the radius R = R(G) is the smallest value of £(x). 

Sample Problem 7.12. Find the diameter and radius of graph (a) of Sample 
Problem 7.11, the Petersen graph (see Exercise 2 in Section 7.3), and the cy¬ 
cle C2n- 
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Solution. In the first graph, four vertices have eccentricity 4 and the others have 
eccentricity 3. So D = 4, R = 3. Every vertex of the Petersen graph has three 
neighbors and is distance 2 from all other vertices, so D = R = 2. In C2 n , every 
vertex is distance n from its opposite vertex, and the distance is smaller for all 
other vertices, so D = R = n. 

Practice Exercise. Find the diameter and radius of the graph (b) of Sample 
Problem 7.11, the wheel W n (defined in Exercise 7.1.20), and the cycle C 2 «+i. 

Theorem 47. In any connected graph G, 

R(G ) < D(G) < 2R(G). 

Proof. Clearly D(G) equals the maximum distance between any two vertices in G. 
Say x and y attain this maximum distance, so that D(x, y) = D(G ), and say z is a 
vertex for which e(z) = R(G). As R is a distance between two vertices, necessarily 
R < D. But by definition D(z, t) < e(z) = R(G) for every vertex t, so D(G) = 
D(x, y) < D(x, z ) + D(z, y) < 2R(G). □ 


Weighted Distance 

In many applications it is appropriate to define a positive function called a weight 
w(x,y) associated with each edge (x,y). For example, if a graph represents a road 
system, a common weight is the length of the corresponding stretch of road. Weights 
also often represent costs or durations. A graph with such a function defined is called 
a weighted graph. The weight of a path P is the sum of the weights of the edges in P . 
Similarly, we could define the weights of walks, cycles, subgraphs, and graphs. 

The routes traveled by an airline can conveniently be shown in a graph, with ver¬ 
tices representing cities and edges representing services. A wide range of different 
weights might be used: the distance, the flying time, and the airfare are all possibili¬ 
ties. 

If x and y are connected vertices, then the weighted distance from x to y is the 
minimum among the weights of all the paths from x to y and is denoted W(x,y). 
Weighted versions of eccentricity, radius, and diameter are defined in the obvious 
way. 


Shortest Paths 

In many applications it is desirable to know the path of least weight between two ver¬ 
tices. This is usually called the shortest path problem, primarily because a common 
application is one in which weights represent physical distances. We shall describe 
an algorithm due to Dijkstra that finds the shortest path from vertex s to vertex t 
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in a finite connected graph G. Informally stated, the algorithm arranges the vertices 
of G in order of increasing weighted distance from s. This algorithm is most easily 
described if w is defined for all pairs of vertices, so we write w(x, y) = oo if v and 
y are not adjacent. (In a computer implementation, oo can be replaced by some very 
large number.) 

The algorithm actually orders the vertices of G as so(= s), s\, S 2 , ... so that the 
weighted distances W(s, s\), W(s, S 2 ), ... are in nondecreasing order. To do so, it 
attaches a temporary label £(x) to each member x of V (G). i (x) is an upper bound 
on the weighted length of the shortest path from s to x . To start the algorithm, write 
so = s and So = {s}. Define £(so) = 0 and for every other vertex y,£(y) = 00 . Call 
this step 0. After step k the set 


Sk = {^ 0 , s 1 ,..., Sk], 

has been defined. In the next step, for each x not in Sk, the algorithm changes the 
value £(x) to the weighted length of the shortest (s,x) path that has only one edge 
not in Sk, if this is an improvement. Then ^+1 is chosen to be a vertex x such that 
t(x) is minimized. (In practice, it does not matter whether the new value of £(x) is 
used or the old value is retained for those vertices x other than Sk+i- Moreover, for 
each member S[ of Sk, it is only necessary to consider one new vertex, a vertex x 
such that w(si , x) is minimal.) 

• soiss; 

• s 1 is the vertex x for which w(s, x) is smallest, the vertex “closest” to s\ 

• to find ^2 one selects, for every vertex x other than s or si, either the lowest 
weight edge sv or the lowest weight path ss\x, whichever is shorter, and then 
chooses the shortest of all these to be S 2 ; 

and so on. After a vertex has been labeled Sj, its £■-value never changes, and this 
final value of £(x) equals W(s, x). Eventually the process must stop, because a new 
vertex is added at each step and the vertex set is finite. So W(s, t) must eventually be 
found. Finally, whenever a vertex is labeled ^+ 1 , one can define a unique vertex si 
that “preceded” it in the process—the vertex before ^+1 in the ssk+i path that had 
length l(sk+i). By working back from t to the vertex before it, then to the one before 
that and so on, one eventually reaches 5 , and reversing the process gives a shortest st 
path. For convenience, the predecessor of each new vertex Sk is recorded at the step 
when Sk is selected. 

Sample Problem 7.13. Figure 7.15(a) shows a graph with weighted edges. Find 
the minimum weight path from s to t. 

Solution. Initially set so = s, So = {s} and l(s) = 0. s has no predecessor. 

The nearest vertex to s is a ( w(s , a) = 5, w(s , c) = 6). So s 1 = a, S\ = {s, a} 
and 1 (a) = 5. a has predecessor 
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sab 



Fig. 7.15. Find the path of minimum weight from 5 to t 


We consider each member of Si. The nearest vertex in V\S\ to s (in fact, the 
only one) is c, and w(s, c) = 6 . The nearest vertex to a is b (w(a, b) = 2, 
w(a, c) = 4, w(a,d) = 3). The candidate values are 1(c) = 6 (through s) and 
1(b) = 7 (through a). The smaller is chosen. So ^2 = c, S 2 = {s, a, c} and 
1 (c) = 6 . c has predecessor s . 

We now process 52 . There is no vertex in V\S\ adjacent to s, so s can be ignored 
in this and later iterations. The nearest vertex to a is b\ w(a,b) = 2 so 1(a) + 
w(a, b) = 7. The nearest vertex to c is d\ w(c, d) = 4 so 1(c) + w(c, d) = 10. 
So S 3 = b, 53 = {s, a, c, b} and 1(b) =7. b has predecessor a. 

The nearest vertex to a is d\ w(a, d) = 3 so 1(a) + w(a, d) = 8. The nearest 
vertex to c is d; w(c, d) = 4 so 1(c) + w(c,d) = 10. The nearest vertex to b 
is e\ w(b , e) = 5 so 1(b) + w(b, e) = 12. So S 4 = d, S 4 = {s, a, c , b, d] and 
1 (d) = 8 . d has predecessor a. 

a need not be considered as all its neighbors are in 54 . The nearest vertex to 
c is /; w(c, f) = 5 so 1(c) + w(c, f) = 11. The nearest vertex to b is e\ 
w(b, e) = 5 so 1(b) + w(b, e) = 12. The nearest vertex to d is e\ w(d, e) = 2 
so 1(d) + w(d, e) = 10. So ^5 = e, S 5 = {s, a, c, b, d, ej and 1(e) = 10. e has 
predecessor d. 

From now on b need not be considered. The nearest vertex to c is /; w(c, f) 
is 5 so 1(c) + w(c, f) = 11. The nearest vertex to d is g; w(d, g) = 3 so 
l(d)+w(d, g) = 11. The nearest vertex to e is h\ w(e, h) = 2 so l(e)+w(e, h) = 
12. Either f or g can be chosen. For convenience, suppose the earlier member 
of the alphabet is always chosen when equal ^-values occur. Then = f, Se = 
{s, a, c, b, d, e, /} and 1 (f) = 11 . / has predecessor c. 

Now c can be ignored. The nearest vertex to d is g; w(d, g) = 3 so 1(d) + 
w(d, g) = 11. The nearest vertex to e is h\ w(e, h) = 2 so 1(e) + w(e, h) = 12. 
The nearest vertex to / is g; w(f, g) = 2 so 1(f) + w(f, g) = 13. So ^7 = g, 
g 7 = {s, a, c, b, d, e, /, g} and t(g) = 11 . g has predecessor d. 
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The nearest vertex to d is h\ w(d, g) = 4 so t(d) + w{d, g) = 12. The nearest 
vertex to e is h\ w(e, h) = 2 so 1(e) + w(e, h) = 12. The nearest vertex to 
/ is i; w(f, i ) = 4 so 1(f) + w(f, i) = 15. The nearest vertex to g is h\ 
w(g, h) = 3 so t(g) + w(g, h ) = 14. So s% = h, S% = {s, a, c, b, d, e, /, g, h} 
and 1(h) = 12. We shall say h has predecessor d (e can also be used, but d is 
earlier in alphabetical order). 

d and e are now eliminated. The nearest vertex to / is z; w(f,i) = 4 so 1(f) + 
w(f, i) = 15. The nearest vertex to ^ is j; w(g, j) = 6 so 1(g) + w(g, j) = 17. 
The nearest vertex to h is t; w(h, t) = 3 so 1(h) + w(h,t) = 15. We always 
choose t when it has the equal-smallest £-value. So ^9 = t , and the algorithm 
stops with i(t) = 15. Since t has predecessor h , the minimum weight path is 
sadht , with weight 15. In Figure 7.15(b), all links to predecessors are empha¬ 
sized. It is easy to read off the minimum weight path from this figure, as well as 
the minimum weight paths from s to a,b, c, d, e, f, g, and h. 

Exercises 7.4 

1. In Exercise 7.3.1 you found all paths from s to t in the graph shown in a given 
graph. What is the distance from s to t in each path? 

2. Suppose the graph G has diameter greater than 3. Show that its complement, G, 
has diameter less than 3. 

In Exercises 3 to 16, what are the diameter and radius of the graph? 

3. The complete graph K v . 

4. The n- vertex path P n . 

5. The complete bipartite graph K m , n . 

6. The ladder L n , a 2^-vertex graph constructed by taking two copies of P n and 
joining corresponding vertices. 


7 . 


8 . 
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17 . The square G 2 of a graph G has the same vertex set as G; x is adjacent to y in 
G 2 if and only if D(x, y) = 1 or 2 in G. What are the squares of K\, n , P 5 and 
C 5 ? 

18 . Refer to the preceding exercise. What are the squares of ^3 4 , and the Pe¬ 
tersen graph? 

In Exercises 19 to 24, find the minimum weight paths from s to t in the weighted 
graph shown. 
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7.5 Trees 

We now discuss a special type of graph called a tree. We start with an example that 
shows why these graphs are useful. 

Sample Problem 7.14. Suppose an oil field contains five oil wells w\, W 2 , u> 3 , 
w 4 , W 5 , and a depot d. It is required to build pipelines so that oil can be pumped 
from the wells to the depot. Oil can be relayed from one well to another, at very 
small cost. The only major expense is building the pipelines. Figure 7.16(a) shows 
which pipelines are feasible to build, represented as a graph in the obvious way, 
with the cost {in hundreds of thousands of dollars) shown {a missing edge might 
mean that the cost would be very high). Which pipelines should be built? 
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d d d 





Solution. Your first impulse might be to connect d to each well directly, as 
shown in Figure 7.16(b). This would cost $2,600,000. However, the cheapest 
solution is shown in Figure 7.16(c), and costs $1,600,000. 

Once the pipelines in case (c) have been built, there is no reason to build a direct 
connection from d to W 3 . Any oil being sent from 1 U 3 to the depot can be relayed 
through w 1 . Similarly, there would be no point in building a pipeline joining W 3 to 
W 4 . The conditions of the problem imply that the graph does not need to contain any 
cycle. However, it must be connected. 

A connected graph that contains no cycle is called a tree. So the solution of the 
problem is to find a subgraph of the underlying graph that is a tree, and among those 
to find the one that is cheapest to build. We shall return to this topic shortly. But first 
we look at trees as graphs. 

We have already seen trees. The tree diagrams used in Chapter 6 are just trees 
laid out in a certain way. To form a tree diagram from a tree, select any vertex as a 
starting point (called the root). Draw edges to all of its neighbors. These are the first 
generation. Then treat each neighbor as if it were the starting point; draw edges to 
all its neighbors except the start. Continue in this way; to each vertex, attach all of 
its neighbors except for those that have already appeared in the diagram. 

Figure 7.17 contains three examples of trees. It is also clear that every path is a 
tree, and the star K\, n is a tree for every n. 

Sample Problem 7.15. Draw the first tree in Figure 1 .17 as a tree diagram. How 
many different-looking diagrams can he made? 





7.5 Trees 


245 


Solution. There are four different-looking diagrams. In the following illustra¬ 
tion, the vertices of the original tree are labeled 1, 2, 3, 4. The diagrams obtained 
using each type of vertex as root are shown. 


1 

2 

3 

4 



Practice Exercise. Repeat this problem for the other two trees in Figure 7.17. 

A tree is a minimal connected graph in the following sense: if any vertex of 
degree at least 2, or any edge, is deleted, then the resulting graph is not connected. 
In fact, it is easy to prove that a connected graph is a tree if and only if every edge is 
a bridge. 

Trees can also be characterized among connected graphs by their number of 
edges. 

Theorem 48. A finite connected graph G with v vertices is a tree if and only if it has 
exactly v — 1 edges. 

Proof, (i) Suppose that G is a tree with v vertices. We proceed by induction on v. 
The theorem is true for v = 1 since the only graph with one vertex is K\, which is 
a tree. Suppose it is true for w < v , and suppose G is a tree with v vertices. Select 
an edge (G must have an edge, or it will be the unconnected graph K v ) and delete it. 
The result is a union of two disjoint components, each of which is a tree with fewer 
than v vertices; say the first component has v\ vertices and the second has t> 2 , where 
v\ V 2 = v. By the induction hypothesis, these graphs have v\ — \ and V 2 — 1 edges, 
respectively. Adding one edge for the one that was deleted, we find that the number 
of edges in G is 

(ui - 1) + (i>2 — l) + l = v — 1. 


(ii) Conversely, suppose G is not a tree. Select an edge that is not a bridge and 
delete it. If the resulting graph is not a tree, repeat the process. Eventually there will 
be only bridges left and the graph is a tree. From what we have just said it must have 
v — 1 edges, and the original graph had more than v — 1 edges. □ 

Corollary 2. Every tree other than K\ has at least two vertices of degree 1. 
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Proof. Suppose the tree has v vertices. It then has v — 1 edges. So, by Theorem 43, 
the sum of all degrees of the vertices is 2 (v — 1). There can be no vertex of degree 
0 since the tree is connected and is not K\ \ if v — 1 of the vertices have degree at 
least 2 , then the sum of the degrees is at least 1 + 2 (v — 1 ), which is impossible. □ 

Sample Problem 7.16. Find all trees with five vertices. 

Solution. If a tree has five vertices, then the largest possible degree is 4. More¬ 
over there are four edges (by Theorem 48), so from Theorem 43 the sum of the 
five degrees is 8 . As there are no vertices of degree 0 and at least two vertices of 
degree 2 , the list of degrees must be one of 


4, 1,1, 1,1, 3,2, 1,1,1, 2,2,2, 1,1. 


In the first case, the only solution is the star K\, 4 . If there is one vertex of degree3, 
no two of its neighbors can be adjacent (this would form a cycle), so the fourth 
edge must join one of those three neighbors to the fifth vertex. The only case with 
the third degree list is the path P 5 . These three cases are as follows: 



Practice Exercise. Find all trees with four vertices. 

Spanning Trees; The Minimal Spanning Tree 

A spanning subgraph of a graph G is one that contains every vertex of G. A spanning 
tree in a graph is a spanning subgraph that is a tree when considered as a graph in 
its own right. For example, it was essential that the tree used to plan the oil pipelines 
should be a spanning tree. 

It is clear that any connected graph G has a spanning tree. If G is a tree, then 
the whole of G is itself the spanning tree. Otherwise G must contain a cycle; delete 
one edge from the cycle. The resulting graph is still a connected subgraph of G; and, 
as no vertex has been deleted, it is a spanning subgraph. Find a cycle in this new 
graph and delete it; repeat the process. Eventually the remaining graph will contain 
no cycle, so it is a tree. So when the process stops, we have found a spanning tree. 
A given graph might have many different spanning trees. 

In many applications—such as the oil pipeline problem—each edge of a graph 
has a weight associated with it, such as distance or cost. It is sometimes desirable to 
find a spanning tree such that the weight of the tree—the total of the weights of its 
edges—is minimum. Such a tree is called a minimal spanning tree. 

A finite graph can contain only finitely many spanning trees, so it is possible in 
theory to list all spanning trees and their weights, and to find a minimal spanning 
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tree by choosing one with minimum weight. This process could take a very long 
time, however, since the number of spanning trees in a graph can be very large. So 
efficient algorithms that find a minimal spanning tree are useful. We present here an 
example due to Prim. 

We assume that G is a graph with vertex set V and edge set E , and suppose there 
is associated with G a map w : E —> R called the weight of the edge; when xy is an 
edge of G we write w(x, y) for the image of xy under w. We can quite easily modify 
the algorithm to allow for multiple edges, but the notation is slightly simpler in the 
graph case. The algorithm consists of finding a sequence of vertices xo, x\, X 2 ,..., 
of G and a sequence of sets So, S\, S 2 , ..., where 

Si = {x 0 ,xi, ...,x,-_i}. 

We choose xo at random from V. When n > 0, we find x n inductively using S n as 
follows. 

1. Given /, 0 < i < n — 1, choose y; to be a member of V \ S n such that w(xi , y{) 
is minimum, if possible. In other words: 

(a) if there is no member of V \S n adjacent to x;, then there is no yi ; 

(b) if V \ S n contains a vertex adjacent to x*, then y z - is one of those vertices 
adjacent to x;, and if Xj ~ y, then w(xi, yi) < w(xi , y). 

2. Provided that at least one yi has been found in step (1), define x n to be a y,- such 
that w(xi , yi) is minimal, in other words, x n is the y* that satisfies 

w(xi,yi) < w(xj,yj) for all j. 

3. Put^+i = S n G{x n }. 


This process stops only when there is no new vertex y/. If there is no new ver¬ 
tex y^ it must be true that no member of V \ S n is adjacent to a vertex of S n . It is 
impossible to partition the vertices of a connected graph into two nonempty sets such 
that no edge joins one set to the other, and S n is never empty, so the process stops 
only when S n = V. 

When we reach this stage, so that S n = V, we construct a graph T as follows: 

(i) T has vertex set V ; 

(ii) if Xk arose as y,-, then x/ is adjacent to x& in T ; 

(iii) no edges of T exist other than those that may be found using (ii). 

It is not hard to verify that T is a tree and that it is minimal. 

Observe that X n may not be defined uniquely at step (2) of the algorithm, and 
indeed >7 may not be uniquely defined. This is to be expected: after all, there may be 
more than one minimal spanning tree. 
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Fig. 7.18. An example of Prim’s algorithm 


Sample Problem 7.17. Find a minimal spanning tree in the graph G shown in 
Figure 7.18(a). 

Solution. Select xo = a. Then S\ = {a}. Now yo = b, and this is the only choice 
for x\ . So S 2 = {a, b}. The tree will contain edge ab. 

Working from 52, we get yo = d and y\ = e. Since be (= x\y\) has smaller 
weight than ad (= xoyo), we select X 2 = e. Then S 3 = { a , b, ej and edge be 
goes into the tree. Similarly, from S3, we get X3 = d and 54 = {a, b, d, ej, and 
the new edge is de. 

Now there is a choice. Working from S 4 , y\ — c and y 2 = /. In both cases the 
weight of the edge to be considered is 6. So either may be used. Let us choose c 
and use edge be. 

The final vertex is /, and the edge is cf. So the tree has edges ab, be, be, cf, de, 
and weight 16. It is shown in Figure 7.18(b). 

Practice Exercise. Find a minimal spanning tree in the graph G shown in Fig¬ 
ure 7.18(c). 

The algorithm might be described as follows. First, choose a vertex * 0 . Trivially 
the minimum weight tree with vertex-set {vo}—the only tree with vertex-set {vo}—is 
the K\ with vertex jto. Call this the champion. Then find the smallest weight tree 
with two vertices, one of which is xo', in other words, find the minimum weight tree 
that can be formed by adding just one edge to the current champion. This tree is the 
new champion. Continue in this way: each time a champion is found, look for the 
cheapest tree that can be formed by adding one edge to it. One can consider each 
new tree to be an approximation to the final minimal spanning tree, with successive 
approximations having more and more edges. 

Prim’s algorithm was a refinement of an earlier algorithm due to Kruskal. In 
that algorithm, one starts by listing all edges in order of increasing weight. The first 
approximation is the K 2 consisting of the edge of least weight. The second approx¬ 
imation is formed by appending the next edge in the ordering. At each stage the 
next approximation is formed by adding on the smallest edge that has not been used, 
provided only that it does not form a cycle with the edges already chosen. In this 
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case the successive approximations are not necessarily connected until the last one. 
The advantage of Prim’s algorithm is that, in large graphs, the initial sorting stage of 
Kruskal’s algorithm can be very time consuming. 

Exercises 7.5 

1. Find all different trees on six vertices. 

2. For each n = 2, 3, 4, 5, 6, find a tree on seven vertices with n vertices of de¬ 
gree 1. 

3. Prove that every bridge in a connected graph lies on every spanning tree of the 
graph. 

In Exercises 4 to 10, find the number of spanning trees in the graph, and sketch all 
the trees. 

4. 5. 



In Exercises 11 to 16, find all different ways to draw the tree as a tree diagram. 
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17. (i) On graph A below, a weight function is shown. Find a minimal spanning 

tree in A. 




A B 

(ii) B is similar, except that no weight is specified for one edge. Find a minimal 
spanning tree in B if that edge has weight 

(a) 1, (b) 4, (c) 7. 

In Exercises 18 to 25, find minimal spanning trees in the graph, using both KruskaVs 
and Prim } s methods. 
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Fig. 7.19. Sample graphs for discussing Hamiltonian cycles 


7.6 Hamiltonian Cycles 

Modeling Road Systems Again 

When we discussed modeling road systems in Section 7.2, the emphasis was on visit¬ 
ing all the roads: crossing the Konigsberg bridges, or inspecting a highway. Another 
viewpoint is that of the traveling salesman or tourist who wants to visit the towns. 
The salesman travels from one town to another, trying not to pass through any town 
twice on his trip. In terms of the underlying graph, the salesman plans to follow a 
cycle. 

A cycle that passes through every vertex in a graph is called a Hamiltonian cycle 
and a graph with such a cycle is called Hamiltonian. The idea of such a spanning 
cycle was simultaneously developed by Hamilton in 1859 in a special case, and more 
generally by Kirkman in 1856. 

A Hamiltonian path is a path that contains every vertex. If you take a Hamiltonian 
cycle and delete an edge you obtain a Hamiltonian path, but the reverse is not always 
true. Some graphs contain a Hamiltonian path but no Hamiltonian cycle; an example 
is the Petersen graph. 

Sample Problem 7.18. Consider the graph in Figure 7.19(a). Which of the fol¬ 
lowing are Hamiltonian cycles ? 

(i) (i cl , b , e, d , c, /, a). 

(ii) (a, b , e, c , d , e, /, a). 

(iii) ( a , b, c, d , e, /, a). 

(iv) ( a , b, c, e, f, a). 

Solution, (i) and (iii) are Hamiltonian, (ii) is not; it contains a repeat of e. (iv) is 

not; vertex d is omitted. 

Practice Exercise. Repeat this problem for the graph in Figure 7.19(b) and the 

following cycles. 

(i) ( a , b, c, /, e, d, a). 

(ii) ( a , b, /, c, b , e , d, a). 





(iii) ( a , b, c, d, e, /, a). 

(iv) ( a , d , e, b, c, /, a). 


7.6 Hamiltonian Cycles 253 


Which Graphs are Hamiltonian? 

At first, the problem of deciding whether a graph is Hamiltonian sounds similar to 
the problem of Euler circuits. However, the two problems are strikingly different in 
one regard. We found a very easy test for the Eulerian property, but no nice necessary 
and sufficient conditions are known for the existence of Hamiltonian cycles. 

It is easy to see that the complete graphs with three or more vertices are Hamil¬ 
tonian, and any ordering of the vertices gives a Hamiltonian cycle. On the other 
hand, no tree is Hamiltonian because they contain no cycles at all. We can discuss 
Hamiltonicity in a number of other particular cases, and there are a number of small 
theorems. 

One useful, sufficient condition for a graph to be Hamiltonian is the following, 
due to Ore. We include the proof, but most students should skip it on a first reading. 

Theorem 49. If G is a graph with v vertices , v > 3, and d(x ) + d{y) > v whenever 
x and y are nonadjacent vertices of G, then G is Hamiltonian. 

Proof. Suppose the theorem is false. There must be at least one counterexample— 
a graph that satisfies the conditions, but is not Hamiltonian. Suppose there is a coun¬ 
terexample with v vertices for some particular v. There may be more than one coun¬ 
terexample with v vertices; if there are, select one that has the largest possible num¬ 
ber of edges among counterexamples and call it G. Choose two nonadjacent vertices 
p and q of G. G + pq must be Hamiltonian because it has more edges than G. 
Moreover, pq must be an edge in every Hamiltonian cycle of G + pq because any 
Hamiltonian cycle that does not contain pq will be Hamiltonian in G. Since G sat¬ 
isfies the conditions, d(p) d(q) > v. 

Consider any Hamiltonian cycle in G + pq. Since it contains edge pq , it will 
look like 

p,x 1,X2, ...,x v -2,q,p. 

If Xi is any vertex adjacent to p , then \ cannot be adjacent to q because if it were, 

then 

P,Xi,X 2 , ...,Xi-i,q,X v - 2 ,X v - 3 , ...,Xi,p, 

will be a Hamiltonian cycle in G. So each of the d(p) vertices adjacent to p in G 
must be preceded in the cycle by vertices not adjacent to q , and none of these vertices 
can be q itself. So there are at least d(p) + 1 vertices in G that are not adjacent to q. 
So there are at least d(q) + d(p) + 1 vertices in G, whence 

d(p) + d(q) < v — 1, 


which is a contradiction. 


□ 
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As a consequence, it follows that if a graph has v vertices (v > 3) and every 
vertex has degree at least then the graph is Hamiltonian. 

On the other hand, suppose a graph G contains a Hamiltonian cycle 

* 1 ,* 2 , • • -,X V ,XU 

Since xi occurs only once in the cycle, only two of the edges touching xi can be in the 
cycle. One can sometimes use this fact to prove that a graph contains no Hamiltonian 
cycle. For example, consider the graph of Figure 7.20. 



Fig. 7.20. A graph with no Hamiltonian cycle 


Suppose the graph contains a Hamiltonian cycle. 

The vertices on the outer circuit are each of degree 3, and only two of the edges 
touching any given vertex can be in a Hamiltonian cycle. Figure 7.21(a) shows as 
heavy lines all the edges touching three of those vertices; of the nine edges, three 
are not in the cycle. Similarly, Figure 7.21(b) shows the 15 edges touching the three 
vertices of degree 5; nine of these are out of the cycle. These sets of edges are disjoint, 
so there are at least 12 edges not in the cycle. (If the sets were not disjoint, but had 
k common elements, only 12 — k edges would definitely be eliminated.) Similarly, 
one of the edges touching the central vertex must be deleted in forming the cycle. So 

13 edges are barred and the Hamiltonian cycle must be chosen from the remaining 

14 edges. Since the graph has 16 vertices, a Hamiltonian cycle in it must contain 16 
edges, which is impossible. 

A similar argument can be used to prove the impossibility of a Hamiltonian path 
in this graph. 

Another test is applicable only to bipartite graphs. As a bipartite graph is a sub¬ 
graph of some K m ^ n , its vertices can be partitioned into two subsets, of sizes m and n , 
such that the graph contains no edge that joins two vertices in the same subset. 

Theorem 50. A bipartite graph with vertex sets of sizes m and n can contain a 
Hamiltonian cycle only if m = n, and can contain a Hamiltonian path only if m 
and n differ by at most 1. 
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Fig. 7.21. Steps in proving there is no Hamiltonian cycle 


Proof. Suppose a bipartite graph G has vertex sets V\ and Vz, and suppose it contains 
a Hamiltonian path 

X\, X2, 

Suppose that x\ belongs to V\ . Then xz must be in Vz, x 3 in V\, and so on. Since the 
path contains every vertex, it follows that 

V\ = {xi,x 3 , ...}, 
y 2 = {x 2 ,X4, ...}. 

If v is even, then V\ and Vz each contain v/2 elements; if v is odd, then \ V\ | = 
(v + l )/2 and \ Vz\ = (v — l)/2. In either case, the difference in orders is at most 1 . 
If G contains a Hamiltonian cycle 

* 1 ,* 2 , • • • > x v,Xi, 

and x\ is in V\, then x v must belong to V 2 ; so | Vi | = | V 2 I = v/2. □ 

It should be realized that neither of these necessary conditions is sufficient; in 
particular, the following bipartite graph has four vertices in each subset (the vertices 
in the two sets are colored black and white, respectively). It contains no Hamiltonian 
cycle; this cannot be proved using the above methods. 



Exercises 7.6 

In Exercises 1 to 6, find a Hamiltonian cycle in the graph shown. 

1 . 2 . 




■6 
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3. 4. 



7. Prove that the following graph contains no Hamiltonian cycle. 



8. Prove that the following graphs contain no Hamiltonian paths. 



9. Prove that the following graph contains no Hamiltonian cycle. 



10. Prove that a spanning subgraph of K mM can have a Hamiltonian cycle only if 
m = n. 

11. Use Theorem 50 to prove that the graph of Figure 7.20 contains no Hamiltonian 
path. 

12. Prove that the Petersen graph contains a Hamiltonian path, but no Hamiltonian 
cycle. 
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Fig. 7.22. Find all Hamiltonian cycles 


7.7 The Traveling Salesman Problem 

Remember our example of a traveling salesman who wishes to visit several cities? If 
the cities are represented as vertices and the possible routes between them as edges, 
then the salesman’s preferred itinerary is a Hamiltonian cycle in the graph. 

In most cases there is a cost associated with every edge. Depending on the sales¬ 
man’s priorities, the cost might be a dollar cost such as airfare, or the number of miles 
to be driven, or the number of hours the trip will take. The cost of the trip is the sum 
of the costs of the edges, and the most desirable itinerary will be the one for which 
this sum of costs is a minimum. The problem of finding this cheapest Hamiltonian 
cycle is called the Traveling Salesman Problem. 

We shall continue to speak in terms of a salesman, but these problems have many 
other applications. They arise in airline and delivery routing and in telephone routing. 
More recently they have been important in manufacturing integrated circuits and 
computer chips and for Internet routing. 

To solve a Traveling Salesman Problem on n vertices, your first instinct might 
be to list all possible arrangements of n vertices and then delete those with two 
consecutive vertices that are not adjacent in the graph. This process can then be 
made more efficient by observing that the n lists a\ci 2 ... a n -\a n , < 22*23 • • -a n a 1 , 

..., a n a\ ... a n - 2 a n -i all represent the same cycle (written with a different start¬ 
ing point), and also a n ... < 23 * 22*21 is the same cycle as * 21^2 . .. a n -\a n (traversed in 
the opposite direction). If there is a vertex of degree 2, then any Hamiltonian cycle 
must contain the two edges touching it. If x has neighbors y and z, then xy and xz 
are in each cycle, and the edge yz, if it exists, is not in any cycle. So it suffices to 
delete x, add an edge yz if there is not one already, find all Hamiltonian cycles in the 
new graph, and delete any that do not contain the edge yz. The Hamiltonian cycles 
in the original are then formed by inserting x between y and z. 

Sample Problem 7.19. Find all Hamiltonian cycles in the graph F of Fig¬ 
ure 7.22. 

Solution. The graph F has two vertices of degree 2, namely a and /. When 
we replace these vertices by edges bd and ce, we obtain the complete graph 
with vertices * 2 , b,c,d (multiple edges can be ignored). This complete graph 
has three Hamiltonian cycles: there are six arrangements starting with b , namely 
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bcde , bced , Z?r/c£, bdec , Z?ecd, and the latter three are just the former three 

written in reverse, bcde yields a cycle that does not contain edges bd and ce. The 
other two give the cycles bcfeda and badcfe, so these are the two Hamiltonian 
cycles in F. 

Practice Exercise. Repeat this problem for graphs G and H of Figure 7.22. 

However, this technique does not reduce the problem enough to make it manage¬ 
able because of the large number of Hamiltonian cycles in complete graphs. 

Theorem 51. The complete graph K n contains ( n — l)!/2 Hamiltonian cycles. 

Proof. There are n\ arrangements of the vertices. Each Hamiltonian cycle gives rise 
to n arrangements in the same cyclic order, and a further n are obtained by reversing 
them. So there are n\/(2n) = (n — 1)1/2 Hamiltonian cycles. □ 

This number grows very quickly. For n = 3, 4, 5, 6, 7 the value of (n — l)!/2 
is 1, 3, 12, 60, 360; in K\o, there are 181,440, and in K 24 , there are about 10 23 
Hamiltonian cycles. Twenty-four vertices are not an unreasonably large network, 
but performing so many summations and comparing them would be impossible in 
practice. To give you some idea of the times involved, if you had a computer capable 
of evaluating and sorting through a million ten-vertex cycles per second, a complete 
search solution of the Traveling Salesman Problem for K 10 would take about 0.18 
seconds. Assuming the computer took about twice as much time to process a 24- 
vertex cycle, the complete search for K 24 would take about a billion years. 

Some fast methods of solution have been developed for sparse graphs—graphs 
where the average degree is low. This is a reasonable assumption in some cases, but 
very often the graph is complete or nearly so. In that case there is no known algorithm 
for solving the Traveling Salesman Problem that is substantially better than listing 
all Hamiltonian cycles. However, fast methods of reaching a “good solution” have 
been developed. Although they are not guaranteed to give the optimal answer, these 
approximation algorithms often give a route that is significantly cheaper than the 
average. 


Approximation Algorithms 

The nearest-neighbor method works as follows. Starting at some vertex x, one first 
chooses the edge incident with v whose cost is least. Say that edge is xy. Then an 
edge incident with y is chosen in accordance with the following rule: if y is the vertex 
most recently reached, then eliminate from consideration all edges incident with y 
that lead to vertices that have already been chosen (including jc), and then select 
an edge of minimum cost from among those remaining. This rule is followed until 
every vertex has been chosen. The cycle is completed by going from the last vertex 
chosen back to the starting position x. This algorithm produces a directed cycle in 
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Fig. 7.23. Traveling Salesman Problem example 


the complete graph, but not necessarily the cheapest one, and different solutions may 
come from different choices of initial vertex x. 

The sorted edges method does not depend on the choice of an initial vertex. One 
first produces a list of all the edges in ascending order of cost. At each stage, the 
cheapest edge is chosen with the restriction that no vertex can have degree 3 among 
the chosen edges, and the collection of edges contains no cycle of length less than v, 
the number of vertices in the graph. This method always produces an undirected 
cycle, and it can be traversed in either direction. 

Sample Problem 7.20. Suppose the costs of travel between St. Louis, Nashville, 
Evansville, and Memphis are as shown in dollars on the left in Figure 7.23. You 
wish to visit all four cities, returning to your starting point. What routes do the 
two algorithms recommend? 

Solution. The nearest-neighbor algorithm, applied starting from Evansville, 
starts by selecting the edge EM , because it has the least cost of the three edges 
incident with E. The next edge must have M as an endpoint, and ME is not al¬ 
lowed (one cannot return to E, it has already been used), so the cheaper of the 
remaining edges is chosen, namely MN. The cheapest edge originating at N is 
NE, with cost $110, but inclusion of this edge would lead back to E, a vertex that 
has already been visited, so N E is not allowed, and similarly NM is not avail¬ 
able. It follows that NS must be chosen. So the algorithm finds route EMNSE, 
with cost $520. 

A different result is achieved if one starts at Nashville. Then the first edge se¬ 
lected is NE, with cost $110. The next choice is EM, then MS, then SN, and 
the resulting cycle NEMSN costs $530. 

If you start at St. Louis, the first stop will be Evansville ($120 is the cheap¬ 
est flight from St. Louis), then Memphis, then Nashville, the same cycle as the 
Evansville case (with a different starting point), costing $520. From Memphis, 
the cheapest leg is to Evansville, then Nashville, and finally St. Louis, for $530— 
the same cycle as from Nashville, in the opposite direction. 

To apply the sorted edges algorithm, first sort the edges in order of increas¬ 
ing cost: £M($100), EN($110), ES($120), MN($130), MS ($150), NS($ 170). 
Edge EM is included, and so is EN. The next choice would be ES, but this is 
not allowed because its inclusion would give degree 3 to E. MN would complete 
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a cycle of length 3 (too short), so the only other choices are MS and NS, forming 
route EMSNE (or ENSME) at a cost of $530. 

However, in this example, the best route is ENMSE, with cost $510, and it does 
not arise from the nearest-neighbor algorithm, no matter which starting vertex is 
used, or from the sorted edges algorithm. 

Practice Exercise. A new cut-rate airline offers the fares shown on the right side 
of Figure 7.23. What do the algorithms say now? 

In the real world, the cost of an itinerary will not necessarily be the sum of the 
individual legs. For example, airlines often offer fares for a flight with a stopover 
that are cheaper than the sum of the fares for the individual legs. However, the ad¬ 
ditive model applies in the majority of applications. In any case, the considerations 
discussed in this section still apply in the more general case—even if the fare is not 
additive, the time needed to check all Hamiltonian cycles is still prohibitive. 

Exercises 7.7 

In Exercises 1 to 6, list all Hamiltonian cycles in the graph shown. 

1 . 2 . 


be a b 
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5. 


6 . 


a b c 




f g 


7. Solve the Traveling Salesman Problem for the graphs G and H in Figure 7.22 
by finding all their Hamiltonian cycles, if the costs associated with the edges are 
as follows: 


For G: 

ab = 18, ad = 12, ae = 30, 
be = 24, = 14, bf = 28. 

cd = 30, cf = 16, 
dc = 21, ef = 25, 


For H: 

aZ? = 19, ad = 16, be = 22, 
bd = 17, be = 21, bf = 25. 
cc = 24, c/ = 19, 
dc = 18, ef = 21, 


8 . Consider a complete graph K$ with vertices abede, the costs associated with the 
edges being as follows: 

ab = 1, ac = 2, ad = 3, ae = 4, be = 7, 
bd = 8, be = 9, cd = 5, ce = 6, de = 10. 

Find the cost of the cheapest Hamiltonian cycle in this graph by a complete 
search. Then find the costs of the routes generated by the nearest-neighbor al¬ 
gorithm starting at each of the five vertices in turn and show that the nearest- 
neighbor solution is never the cheapest. 

In Exercises 9 to 18, you are given the costs associated with the edges of a complete 
graph with vertex-set [a, b, c, d, ej. Find the costs of the routes generated by the 
nearest-neighbor algorithm starting at each of the five vertices in turn and by the 
sorted edges algorithm. 

9. ab = 24, ac = 23, ad = 20, ae = 21, be = 27. 

bd = 22, be = 30, cd = 26, ce = 27, de = 28. 

10. ab = 24, ac = 26, ad = 20, ae = 21, be = 33. 

bd = 29, be = 30, cd = 25, ce = 27, de = 28. 

11. ab = 24, ac = 22, ad = 30, ae = 29, be = 17. 

bd = 19, be = 30, cd = 18, ce = 21, de = 25. 

12. ab = 59, ac = 69, ad = 60, ae = 58, be = 56. 

bd = 69, be = 54, cd = 58, ce = 66, de = 61. 

13. ab = 12, ac = 18, ad = 14, ae = 19, be = 22. 

bd = 16, be = 15, cd = 18, ce = 17, de = 11. 
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14. ab = 16, ac = 24, ad = 30, ae = 48, Z?c = 27. 

bd = 29, Z?e = 44, cd = 16, ce = 46, de = 51. 

15. ab = 27, ac = 22, ad = 28, ae = 17, Z?c = 33. 

Z?d = 14, Z?e = 23, cd = 24, ce = 26, de = 19. 

16. ab = 91, ac = 79, ad = 75, ae = 82, 7>c = 87. 

bd = 64, Z?e = 78, cd = 68, ce = 81, de = 88. 

17. ab = 45, ac = 28, ad = 50, ae = 36, 7>c = 21. 

fed = 42, be = 34, cd = 44, ce = 39, de = 25. 

18. aZ? = 11, ac = 15, ad = 13, ae = 18, be = 20. 

bd = 12, Z?e = 16, cd = 14, ce = 19, de = 17. 


8 


Matrices 


A matrix is a two-dimensional array of numbers. Sometimes it is natural to classify 
data in two ways—for example, you might classify your music collection by media 
(MP3 or CD) or by type (folk, heavy metal, ...); a realtor might classify her listings 
by area or by price. Matrices are useful for representing this type of data. Matrices 
arise in many parts of mathematics; usually they are studied as part of linear algebra. 

There are many applications of matrices in discrete mathematics. They are par¬ 
ticularly important in the study of relations and graphs, so we have included a section 
that briefly introduces those applications. Many other properties of graphs and rela¬ 
tions can be deduced using further algebraic properties of matrices. 


8.1 Vectors and Matrices 

Vectors 

A vector is an ordered finite set of numbers. We say real vector if all the elements 
are real numbers; we can also talk about integer vectors , and so on. If it is required 
that all elements belong to some number set, those numbers are called the scalars for 
those vectors. 

The number of elements in a vector is called its dimension or length. The set of all 
real vectors of dimension n will be denoted W 1 . The set M 2 is familiar from geometry 
because the coordinates of points in the plane are just members of M 2 . Similarly, M 3 
is the set of coordinates of points in three-dimensional space. 

When we talk about the elements of a vector and specify their positions we usu¬ 
ally call them entries. The usual notation for a vector is to denote its ith entry by 
subscript i. The vector v has entries v\, V 2 , ..., and we write v = (iq, V 2 , .. .)• 
Two vectors are equal if and only if they are equal in every position (< componentwise 
equal): u = v if and only if Ui = i h for every i. It follows that equal vectors must be 
of the same dimension. 


W.D. Wallis, A Beginner's Guide to Discrete Mathematics , 
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In physics or in geometry we encounter vectors as directed line segments. If 
you prefer to think of them in this way, then the vector (1,2, 1) corresponds to the 
directed line from the origin to point (1, 2, 1) in three-dimensional space. 

There are two standard operations on vectors: one may multiply by a scalar , and 
one may add vectors. They are defined as follows. 

If v = (i>i, V 2 , • • •, v n ), and k is any scalar, then kv = (kv i, kv 2 , ..., kv n ). If 
u = (u 1 , U 2 , ..., u n ), and v = (v\, V 2 , ..., v n ), then 

u + V = ((Ml + Vi), (M 2 + U 2 ), . . . , (u n + n n )). 

If u and v are vectors of different lengths, then m + r is not defined. 

It is easy to see that vector addition satisfies the commutative and associative 
laws: if u and v are any two vectors of the same length, then 

u + V = V + M, 

and any three vectors t,u , and r of equal length satisfy 

t T - (w T - r) = (£ T - 11 ) v. 

We usually omit brackets when writing the sum of three or more vectors, so this 
common value is simply denoted t + u + v. 

There are also two laws involving multiplication by a scalar. If a and b are any 
numbers and u and v are any two equal-length vectors, then 

a(bu) = ( ab)u , 


and 

a(u + v) = ( au ) + (av). 

Again we follow the same convention as ordinary arithmetic and omit brackets, writ¬ 
ing the last expression as simply au + av. 

Sample Problem 8.1. Calculate 3(1, —1, 3) and (2, 2) + (— 1, 3). 

Solution. 3(1, -1, 3) = (3, -3, 9); (2, 2) + (-1, 3) = (1, 5). 

Practice Exercise. Calculate 4(2, 0, —1) + (1, 4, —3). 


Subtraction is defined in the natural way: u — v = ((u\ — v\), (02 — V 2 ),, 
( u n — v n )). If we write — v = (— v\, —V2, • • •, — v n ), then u — v = u + (— v), and 
moreover —v = (— l)v. 

It is natural to define a zero vector 0 = (0, 0, ..., 0) (in fact, a family of zero 
vectors, one for each possible dimension). Then v + (— v) = 0, as one would expect. 
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Lines; the Scalar Product 

The equation of a straight line in coordinate geometry can be written in the form 


ax + by = c 


where a, b, and c are numbers and x, y are the usual variables. The equation involves 
two vectors, the vector (a, b) of coefficients and the vector (x, y) of variables. For 
this reason it is natural to associate ax + by with the two vectors (a, b) and (x, y). 

We define the scalar product of two vectors u = (u\, U 2 , ..., u n ) and v = 
(v\, V 2 , • • •, v n ) to be 


n 



k= 1 


This is also called the dot product of the two vectors. In this notation a typical straight 
line in two-dimensional geometry has an equation of the form 


a x = c, 


where a is some vector of two real numbers, x is the vector of variables (x, y), and 
c is a constant. If a and x are of length three, then a x = c can be the equation of a 
plane in three-dimensional space. 

Sample Problem 8.2. Suppose t = (1, 2, 3), u = (—1,3,0), and v = (2, —2, 2). 

Calculate u v, (t — u) v, and 3(v • t). 

Solution, u v = — 2 — 6 + 0 = —8; (t — u) - v = (2, -1,3)- (2, —2, 2) = 4 + 

2 + 6 = 12; 3(v • t) = 3 x 8 = 24. 

Practice Exercise. Calculate u • t and (2 u — 3 v) t. 

Sample Problem 8.3. Suppose (x, 3) = (4, x + y). What are x and y? 

Solution. Two vectors are equal if and only if their components are equal. So 

we have the two equations x = 4 and 3 = x + y. So x = 4 and y = — 1. 

Practice Exercise. Suppose (2x, 2) = (y, x). What are x and y? 

It is not hard to see that the scalar product is commutative. There is no need to 
discuss the associative law because scalar products involving three vectors are not 
defined. For example, consider t (u v). Since ( u • v) is a scalar, not a vector, we 
cannot calculate its dot product with anything. 
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Matrices 

Just as one linear equation can be written as one vector equation, a set of linear 
equations can be written as a set of vector equations. For example, the equations 

2x + y = 3, 

3x — 2y = 1, 

can be written as 


((2 , 1) • (jc, y), (3,-2) • (x, y)) = (3, 1), 
or, a little easier to read, 


(2, 1) • (x,y)‘ 


"3" 

_ (3, —2) ■ (x,y) _ 


1 


This can be seen as the two-dimensional array of numbers 

"2 1 " 

3 -2 ’ 


acting on the vector (x, y) to produce the vector (3, 1). We shall call this array a 
matrix. We usually denote matrices by capital letters, to distinguish them from scalars 
(numbers). 


Just as a vector generalizes a number, a matrix generalizes a vector. A matrix is 
any rectangular (two-dimensional) array of numbers. A typical matrix is 


M = 


1 

2 

4 


-14 3 

3 -4 1 

-4 4 8 


This example is a 3 x 4 real matrix (and it is, in fact, a 3 x 4 integer matrix). We call 
3 x 4 the shape or size of the matrix, and the two numbers 3 and 4 are its dimensions. 

A vector can be interpreted as a matrix with one of its dimensions 1. A 1 x n 
matrix is called a row vector and an n x 1 matrix is called a column vector. An 
m x n matrix can be viewed as a vertical stack of m row vectors, and those vectors 
are called the rows of the matrix. Alternatively, we can see it as a horizontal array of 
column vectors, called the columns of the matrix. Our example M can be viewed as 


"(i -i 

4 

3)" 


1 

-1 

4 

3 

(2 3 

-4 

1) 

or 

2 

3 

-4 

1 

1 - 

4 ^ 

1 

4 ^ 

4 

8) 


4 

-4 

4 

8 


The element in the i th row and jth column is called the (/, j) element of the matrix. 
It is standard to denote the (/, j) element of a matrix A by atj , and write A = (atj). 
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This is particularly useful when a formula is given for atj . For example, we might 
refer to the 3 x 3 matrix whose (/, j) element equals i + j, namely 

"2 3 4" 

3 4 5, 

4 5 6 


as “the 3x3 matrix (/ + j) ” 


Sums and Products 

The sum of two matrices and the product of a scalar with a matrix are defined 
analogously to the vector case. We again define negatives by —A = (—1 )A. Ad¬ 
dition satisfies the commutative and associative laws, and the scalar product laws 
a(bC ) = (i ab)C and a(C + D) = aC + aD. 


Sample Problem 8.4. Suppose A , B, and C are the matrices 


A = 



3 

2 



0 

4 


Find A 4- B, 2 A — 3 B, 3 A T C, — C. 



Solution. 


A + B = 
-C = 


-1 3 

0 6 J ’ 

1 -4 -3 

1 2 1 


2A - 3 B 


-4 6 

-5 -8 


3 A + C is not defined, as A and C are of different sizes. 


Practice Exercise. Calculate — A,3A — B, B + C. 


We define the product of two matrices as a generalization of the scalar product 
of vectors. Suppose the rows of the matrix A are a \, • • •, and the columns of the 

matrix B are b \, # 2 , • • • • Then A B is the matrix with (/, j) entry at b j . These entries 
will only exist if the number of columns of A equals the number of rows of A, so this 
is a necessary condition for the product AB to exist. 


Sample Problem 8.5. Suppose 




1 

0 


Find AB and BA. 
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Solution. First we find AB. The rows of A are a\ = (1, 2) and a 2 = (1, —1). 
The columns of B are b\ = (—1,2) and b 2 = (1,0). (Since we are treating them 
as vectors, it does not matter whether we write them as row or column vectors.) 
Then a\ b\ — —1+4 = 3, and similarly a\ • #2 = 1, a 2 • b\ = —3 and 
a 2 b 2 = 1. So 

3 r 

-3 1 _ * 

Similarly we find 



The entries in A B will only exist if the number of columns in A equals the num¬ 
ber of rows in B. For example, if A were 3x2 and B were 4x4, the product will 
not exist. In general, we can say the following. 

Theorem 52. Suppose A is an m x n matrix and B is an r x s matrix. If n = r, then 
AB exists and is ann x s matrix. If n r, then AB does not exist. 

If A were 2x3 and B were 3x4, then AB would be a 2 x 4 matrix but BA 
would not exist. It is also possible that AB and BA might both exist, but might be of 
different shapes. For example, if A and B have shapes 2x3 and 3x2, respectively, 
then AB is 2 x 2 and BA is 3 x 3. And we observe from the preceding example that, 
even when AB and BA both exist and are the same shape, they need not be equal. 
There is no commutative law for matrix multiplication. 

We shall look at some more properties of matrix multiplication in the next sec¬ 
tion. 


AB = 


a\ • b\ 
a 2 • b\ 


a\ • b 2 
a 2 • b 2 


Transposition 

If A is an m x n matrix, then we can form an n xm matrix whose (/, j) entry equals 
the (j, i) entry of A. This new matrix is called the transpose of A, and written A T . 
A matrix A is called symmetric if A = A T . 

Sample Problem 8.6. What is the transpose of the matrix A from Sample Prob¬ 
lem 8.5? 

Solution. 



Practice Exercise. What is the transpose of the matrix B from Sample Prob¬ 
lem 8.5? 
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The transposition symbol is applied only to the matrix nearest to it. For example, 
the expression AB T means A(B T ), not (AB) T . 

Exercises 8.1 


1. Verify that real matrices satisfy the commutative and associative laws. 
Carry out the vector computations in Exercises 2 to 23. 


2. 4(2, -2). 

4. 2(5, 1, -1). 

6. (2, 3)+ (1,4). 

8. (1,0,3)+ 3(4, 4, 4). 

10. 3(—1, 2, 3) + 2(1, 1, -1). 

12. 3(1,0, 1,0) -4(2,0, -1, -1). 
14. (2, 3)-(1,-1). 

16. (1, 1, -1) • (2,0,3). 

18. (4,2, 1) • (1,2,4). 

20. (0,2, 1, 1) • (3,4,2, 1). 

22. (3, -1,3,2) • (-1, -1,2, 1). 


3. -(2,-2). 

5. 3(3,6, 1). 

7. 3(2,3) -2(1,4). 

9. 2(—1, -1,2) -2(2, -1, -1). 

11. 3(1,-2, 2)+ 2(2, 3,-1). 

13. 2(4, -1, 2, 3) - 3(1, 6, -2, -3). 
15. (1,-1). (2,3). 

17. (1,3,3) • (1,0, -2). 

19. (1,-1,-2). (3, 2,-1). 

21. (-1,2, -1,3) • (2,4, -3, -1). 
23. (1, -2,5,2) • (2,2,3, 1). 


A is a 2x4 matrix; B is 2 x 4; C is lx 3; D is 4 x 2; E is 3 x 4; E is 4 x 3; G is 
4 x 4. In Exercises 24 to 41, my whether the indicated matrix exists. If it does exist, 
what is its shape ? 


24. A + B 
27. AD. 
30. F t . 
33. DA. 
36. GG. 
39. CEE. 


25. 2A - B. 
28. F>(A + F). 
31. CF r . 

34. AD 4-DA. 
37. FF. 

40. CEE. 


26. CF. 

29. CF. 

32. 2FC. 

35. FFF. 

38. DA + 3G. 
41. AGF. 


42. Suppose < 2 , Z? are any two real numbers and C, D are any two real matrices of 
the same shape. Prove the following equalities. 

(i) a(bC) = (ab)C. 

(ii) a(C + Zl) = aC + aD. 

(iii) (a + b)C = aC + bC. 

In Exercises 43 to 48, carry out the matrix computations. 


" 4 

-1" 

+ 

" 3 

-1" 

-2 

0 _ 

-1 

2 
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43. 

45. 3 


47. 


44. 3 


1 -1 -1 

-2 0 10 


"10 

-1" 

- 2 

1 

-1" 

2 

7 


-1 

1 


46. 


3 -4 
3 0 


2 0 1 
3 -1 1 


1 

" 2 

-1 


" 2 

-1 


1 

-1 

48. 

1 

-1 

- 

-1 

1 


-1 

1 


6 -2 
-7 4 


2 0 1 
3 -1 1 


In Exercises 49 to 64, 


A = 

1 

-2 

-1" 
3 _ 

’ 

B = 

" 3 
-1 

0 

4 

- 1 " 

4 

, c = 

"6" 

2 

’ 


"-1 

1 

- 1 " 


"1 

-1" 




2" 

D = 

1 

3 

3 

E = 

1 

0 


F = 

— 

1 


-2 

2 

0 


2 

2 




2 


G = [2 1 -1], H = [2 2], 


K = [~ 1 2]. 


Carry out the matrix computations , or explain why they are impossible. 


49. 2A. 

50. 

BD. 

51. 

AC. 

52. EF. 

53. BE. 

54. 

B t . 

55. 

2 H - 3 K. 

56. HA + 2K 

57. CF. 

58. 

D - EB. 

59. 

BG. 

60. KB. 

61. GD. 

62. 

CE T . 

63. 

EK + KB. 

64. 2 F + G. 

65. Suppose 


X 

-1 

-1" 

2 

~y +1 -1" 

— 1 X 



What are the values of x and yl 
66. Find x, y, and z so that 


x - 2 3 z 


y z 3 

<N 

_I 


_ 3z ;y + 2 6z _ 


67. A and 5 are any two matrices such that AB exists. Prove that B T A T exists and 
that 

B t A t = ( AB) t . 
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8.2 Properties of the Matrix Product 

Identity Elements 

Suppose A is an r x s matrix. Adding 0 to every entry will not change A. So the r xs 
matrix with every entry 0 will act like a zero element, or additive identity, for A. We 
shall denote this matrix by O r , s and call it a zero matrix. Usually the subscripts can 
be omitted because the shape can be deduced from the context. For any matrix A, the 
appropriate zero matrix satisfies 


A + O = O + A = A and A + (-A) = (-A) + A = O. 

Writing — A for (—1) A is an extension of the notation we used for zero vectors, and 
is consistent with the usual arithmetical notations for zero and negatives. 

The zero matrix also behaves under multiplication the way you would expect: 
provided zero matrices of appropriate size are used, 

OA = O and AO = O. 


This is not just one rule, but an infinite set of rules. If we write in the subscripts, then 
the full statement is as follows. 

If A is any rxs matrix, then O mj A = O m , s for any positive integer m, and AO s , n = 
O rn for any positive integer n. 

There are also multiplicative identity elements. We define I n to be the nxn matrix 
with its (1, 1), (2, 2), ..., (n, n) entries 1 and all other entries 0. For example, 


h 


1 0 0 
0 1 0 
0 0 1 


If A is any rxs matrix, then I r A = A = AI S . 
We call I n an identity matrix of order n. 


Commutativity 

We saw that the commutative law does not hold for matrices in general. Even if A B 
and BA are both defined and are the same size, it is possible for the two products 
to be different (see Sample Problem 8.5). On the other hand, some pairs of matrices 
have the same product in either order. If A B = B A we say that A and B commute , 
or A commutes with B. For example, any 3x3 matrix commutes with A. There are 
many other examples. 
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Sample Problem 8.7. Show that the following matrices commute. 


Solution. 




2 

3 


AB = BA = 


3 

2 



Practice Exercise. Show that the following matrices commute. 


3 1 


-1 1 



1 

0 


Suppose A has shape m x n and B is r x s. If both AB and BA exist, then 
necessarily n = r and m — s\ then AB is m x m and BA is n x n. In order for A 
and B to commute, we must have m — n. Both A and B must have the same number 
of rows as columns. Such a matrix is called square , and the common dimension is 
called its order. 

If A is square, we can evaluate the product A A. We call this A squared , and write 
it as A 2 , just as with powers of numbers. We define other positive integer powers 
similarly: A 3 = AAA = A A 2 , and in general A n+l = A A n . 


Inverses 

If the matrices A and B satisfy AB = BA = /, we say that B is an inverse of A. 

In the real numbers, everything but 0 has an inverse. In the integers, only 1 and 
— 1 have integer inverses, but we know that we can obtain inverses of other nonzero 
integers by going to the rational numbers. The situation is obviously more compli¬ 
cated for matrices because only a square matrix can have an inverse. Moreover, there 
are nonzero square matrices without inverses, even if we restrict our attention to the 
2x2 case. 

Sample Problem 8.8. Show that the matrix 


has no inverse. 

Solution. Suppose A has inverse 

x y 
z t 


B = 
















8.2 Properties of the Matrix Product 273 


Then AB = /, so 


2 r 



2 x + z 

2 y +1 


"1 0" 

2 1 

_z t 


2x + z 

2y +1 


_° 1 


The (1, 1) entries of the two matrices must be equal, so 2 x+z = 1; but the (2, 1) 
entries must also be equal, so 2x + z = 0. This is impossible. 


Practice Exercise. Show that the matrices 

0 1 
0 0 


C 


1 0 
0 0 


and D 


have no inverses. 

A matrix that has an inverse will be called invertible or nonsingular ; a square 
matrix without an inverse is called singular. 

We used the phrase “an inverse” above. However, we shall prove that, if a matrix 
has an inverse, it is unique. 

Theorem 53. If matrices A, B, C satisfy AB = B A = I and AC = C A = I, then 
B = C. 

Proof. Suppose A, B, and C satisfy the given equations. Then 


C = CI = C(AB) = ( CA)B = IB = B, 


so B and C are equal. □ 

In fact, it can be shown that either of the conditions AB = I ox BA = I is 
enough to determine that B is the inverse of A. However, this requires more algebra 
than we shall cover in this book. 


Sample Problem 8.9. Suppose 


What is the inverse of A if it exists? 



1 

1 


Solution. Suppose the inverse is 


C = 


y 


z 

t 


Then AC = / means 




















274 


8 Matrices 


"3 

2" 

X z 


"1 

0" 

4 

3 _ 

_y t_ 


_0 

1 


which is equivalent to the four equations 

3x + 2y = 1, 3z + 2t = 0, 

Ax + 3y = 0, 4z + 3f = l. 

The left-hand pair of equations is easily solved to give x = 3 and y = —4, while 
the right-hand pair gives z = — 2 and £ = 3. So the inverse exists, and is 



Practice Exercise. What is the inverse of B if it exists? 

The above procedure can be used to invert square matrices of any order; if there 
is no inverse, then the equations will have no solution. In the next section we shall 
show how to reduce the number of computations required. 

The usual notation for the inverse of A, if it exists, is A -1 . If we define A 0 = I 
whenever A is square, then the powers of matrices satisfy the usual index laws 

A m A n = A m+n (A m y = A mn 

for all nonnegative integers m and n , and for negative values also provided that A -1 
exists. If x and y are nonzero reals, then ( xy)~ l = x -1 y -1 . The fact that matrices 
do not necessarily commute means that we have to be a little more careful. 

Theorem 54. If A and B are invertible matrices of the same order, then AB is in¬ 
vertible, and 

(.AB) -1 = B~ X A~ X . 

Proof. We need to show that both (B~ x A~ X ){AB) and {AB){B~ X A~ x ) equal the 
identity. But (B~ x A~ X )(AB) = B~ X {A~ X A)B = B~ X IB = B~ x B = I = 
AA" 1 = A/A" 1 = A(BB~ l )A~ x = (AB)(B~ l A~ l ). □ 

There are two cancellation laws for matrix multiplication. If A is an invertible 
r x r matrix and B and C are r x s matrices such that AB = AC, then 

AB = AC =>• A~ l (AB) = A~ X (AC) 

=>• (A"*A)B = (A"*A)C =>■ IB = 1C B = C, 

so B = C. Similarly, if A is an invertible s xs matrix and B and C are r x s matrices 
such that BA = CA, then B = C. 

The requirement that A be invertible is necessary. We can find matrices A, B, 
and C such that AB and AC are the same size, A is nonzero, and AB = AC, but B 
and C are different. One very easy example is 
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"1 0" 

"1 

-1" 


"1 0" 

"1 -1" 

-1 

o 

o 

_i 

2 

3 _ 


o' 

o 

1 4 


Some other examples are given in the exercises. 

Moreover we can only cancel on one side of an equation; we cannot mix the two 
sides. Even if A is invertible it is possible that AB = CA but B ^ C (see Exercises 
8.2.21 and 8.2.22). 

Exercises 8.2 


1. Is it correct to say that, for any matrix A, AO = O = OA1 Why or why not? 

2. Prove the following: if A is an invertible s x s matrix and B and C are r x s 
matrices such that BA = CA, then B — C. 

In Exercises 3 to %,find the products AB and BA. Do the two matrices commute? 


3. A = 

5. A = 

7. A = 

8. A = 


"2 

0 " 

, B = 

"-2 

4' 

_1 

-1 

_ 3 

2_ 

" 3 

r 

, B = 

" 1 

3' 

-1 

3 _ 

_ —3 

1 

" 3 

1 

-2" 



1 

2 

-1 

, B 

= 

-1 

- 

1 3 




4. 


6 . 



2 0 1 
-13 0 
0 1 2 


-1 

1 

2 " 

4 


4 
2 1 


In Exercises 9 to 14, A is given. Find A 2 and A 3 . 


9. 



1 

1 



-1 

0 


11 . 





3 

1 


13. 



0 3 

3 -1 

-1 1 


14. 


1 2 

0 -1 

0 0 



1 

2 


15. Consider the matrix 

, n 3 1 
L 5 3 _' 

(i) Find A 2 and A 3 . 

(ii) Evaluate A 3 —2 A — I. 

(iii) Show that A 2 — 4A — 12/ = O. 
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In Exercises 16 to 19, show that the matrices are inverses. 



"2 

5" 




3 

-5" 


r 3 

-1" 




2 

2 "I 

16. 

_1 


and 


-1 


17. 

2 

and 


3 

3 


3 




2 

_0 

1 



_0 

1 _ 


- l 


r 



"-2 

2" 

19. 

" 2 

_3 




"2 

3 

18. 

2 


and 


1 

-1 



and 

1 


1 

— 

1 



-2 


2 




2 


20. Find a matrix A such that 


"2 

r 


"1 

- 1 " 

3 

2 


1 

4 


In Exercises 21 and 22, show that AB = AC. 


21. 

A = 

"1 

_2 

-1" 
—2_ 

’ 

B = 

"2 

1 

r 

4 _ 

, c = 

i i 

UJ 4^ 

i i 

O CO 






"1 

0 

r 



"1 

1 2 



"2 

2 

r 

22. 

A = 

1 

1 

i 

, B : 


1 

1 0 

, c 

= 

1 

1 

0 



1 

2 

i 



1 

2 2 



0 

1 

3 


In Exercises 23 and 24, show that A 1 exists, but AB = CA, even though B ^ C. 


23. A = 

24. A = 


-1 3 

1 -2 

2 - 1 " 

1 1 


B 


B = 


1 2 
1 1 

3 0 

4 1 


5 7 

-2 -3 


C = 


1 0 
2 3 


25. The (leading) diagonal of a matrix is the set of entries in positions (1, 1), 
(2, 2), (3, 3), .... A diagonal matrix is a square matrix with all of its elements 
zero except those on the diagonal. Prove that any two diagonal matrices of the 
same order commute. 


26. Suppose M x denotes the 2 x 2 matrix 


1 x 
0 1 


where x may be any real number. 

(i) Compute M x M y , and show that the matrices M x and M y commute for any 
real numbers x and y. 

(ii) Find M 2 , M x 3 , and M x 4 . 

(iii) Find a formula for M x n , where n is any positive integer. 

(iv) What is M x -'l 

27. A square matrix A is called idempotent if it satisfies A 2 = A. 
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(i) Which of the following matrices are idempotent? 


1 0 0 


1 -1 0 


1 

Cl 

1 

<N 

0 1 0 

9 

0 0-2 

9 

-13 4 

1 

o 

o 

o 


-2 1 1 


1 -2 -3 


(ii) Prove that, if A is an idempotent matrix, then / — A is idempotent. 


8.3 Systems of Linear Equations 

Matrix Representation of Equations 

The solution set of a system of equations such as 

2x + 4y — 4z = 4, 

-2 y + 4z = 6, 
x — y + 4z = 10, 

is the set of all assignments of values to the variables that make all the equations true. 
For example, the above system has solution x = 3, y = 1, z = 2. We often write 
the solutions as vectors with the variables taken in standard (alphabetical) order, and 
would say the solution set is {(3, 1,2)}. Other systems have infinite solution sets, or 
the set could be empty, in which case we say the system is inconsistent. 

We introduced matrices in Section 8.1 by pointing out their relationship to sets 
of linear equations. The above system of equations can be written as 


"(2,4, —4) • (x, y, z) ” 


" 4 " 

(0, -2,4 )-(x,y,z) 

= 

6 

_(1, -1,4) • (x,y,z)_ 


10 


which can be seen as the matrix 

"2 4 -4" 

0-24, 

_i -1 4 _ 

acting on the vector (x, y, z) to produce the vector (4, 6, 10) 


"2 4 

-4 


X 


" 4 " 

0 -2 

1 -1 

i 

d- d- 


Cl ^ 

l_ 

— 

6 

10 


The typical set of m linear equations in n unknowns can be written 


Ax = b, 
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where A is an m x n matrix of coefficients, x is an n x 1 matrix (column vector) of 
unknowns, and b is the m x 1 matrix (column vector) of the right-hand sides of the 
equations. 

The usual way of solving such a system of equations is to use one equation to 
express one variable in terms of the other variables and substitute for that variable in 
other equations. Repeat this process in the new set of m — 1 equations. Continue until 
only one equation remains. This equation is used to evaluate one variable (either as 
a constant, or in terms of those other variables not yet eliminated), and the remain¬ 
ing variables are evaluated from the equations used earlier. If this process fails (for 
example, if two contradictory equations result), the equations were inconsistent and 
there are no solutions. 

We shall formalize this process. We define the augmented matrix of the system 
Ax = b to be the matrix [A \ b ] formed by adjoining the vector of constants b to 
the matrix A of coefficients. The vertical line indicates the division between the two 
types of element. The augmented matrix of the system given above is 


2 

4 

-3 

4 

0 

-2 

4 

6 

1 

-1 

4 

10 


The first step is to select a variable to eliminate. This is equivalent to choosing 
a column in the augmented matrix and selecting a row—an equation—to use for 
substitution. The only requirement is that the matrix has a nonzero entry in that 
row and column. Let us choose row 3, column 1, representing variable v in the first 
equation. It will be convenient to interchange rows 1 and 3, so that we are operating 
on the (1,1) entry. This is equivalent to rewriting the equations in a different order. 
The matrix is now 


1 

-1 

4 

10 

0 

-2 

4 

6 

2 

4 

-3 

4 


Rl R3 
R3 Rl, 


where the annotations mean the new row 1 is the old row 3 and the new row 3 is the 
old row 1 . 

Now substitute for x in the other equations. No action is required in the second 
equation, but v must be eliminated from the third. So we subtract twice the first 
row from the third row. This yields precisely the equation we would get if we used 
equation 1 to substitute for v in equation 3, but for consistency we have kept all the 
variables on the left-hand side of the equation. The augmented matrix becomes 


”1 

-1 

4 

10 

0 

-2 

4 

6 

0 

6 

-11 

-16 


R3 <r- R3-2R1, 











8.3 Systems of Linear Equations 279 


where the legend means the new row 3 is ( the old row 3) —2 (the old row 1). (When 
we say old we are referring to the preceding augmented matrix, not to the original 
one.) 

Now multiply row 2 by — ^. Then eliminate the 6 y from the third equation. The 
result is 


1 

-1 

4 

10" 

0 

1 

-2 

-3 

0 

0 

1 

2 


R2 ^ R2 

R3 +- R3-6(^R2 


This could have been broken into two steps. 

So far we have done the equivalent of substituting in the later equations. Now we 
substitute back to find the values. We know from the third equation that z = 3. To 


substitute this in the earlier equations, we 
times row 3 from row 1: 


"1 

-1 

0 

2 

0 

1 

0 

1 

0 

0 

1 

2 


Next add row 2 to row 1: 


1 

0 

0 

3 

0 

1 

0 

1 

0 

0 

1 

2 


add twice row 3 to row 2 and subtract four 

R1 <r- Rl -4R3 
R2 R2 + 2R3. 

Rl <r- RI + R2. 


The resulting array can be translated into the equations 
x = 3, y = 1, z — 2. 


Elementary Operations 

In our example, we used three operations: 

El: permute the rows of the matrix; 

E2: multiply a row hy a ( nonzero ) constant ; 

E3: add a multiple of one row to another row. 

We shall call them elementary row operations. Their importance comes from the 
following theorem. 

Theorem 55. Suppose P is the augmented matrix of a system of linear equations and 
Q is obtained from P by a sequence of elementary row operations. Then the system 
of equations corresponding to Q has the same solutions as the system corresponding 
to P. 
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Proof. Operation El does not change the system. 

The equations a • x = b and ka x — kb have the same solutions when k is 
nonzero. So E2 does not change the solution set. 

Finally, suppose xo is a solution of the system of two equations a x = b , 
a' • x = b' . Then a • xo = b and • Jto = b' 9 so (« + £a r ) • jco = b + kb' , 
and vo is a solution of the system a • x = b, a + ka' x = b + kb'\ conversely, any 
solution of the second system is a solution of the first set. If the same further set of 
equations is appended to each system, the two resulting systems still have the same 
solutions, so E3 does not change the solution set. □ 


The Solution Algorithm 

It is clear that the repeated application of the three steps to the augmented matrix 
will provide a solution. The algorithm for solving systems of linear equations works 
as follows. 

Stage 1 

1. Find the leftmost column in the matrix of coefficients that contains a nonzero 
element, say column j. Use El to make the row containing this element into the 
first row, and E2 to convert its leftmost nonzero element to 1. This is called a 
leading 1. Then use E3 to change all entries below the leading 1 to zero. That is, 
if the (/, j) entry is a[j , then subtract aij x (row 1) from row j. 

At this stage we say column j is processed. Processed rows are not disturbed in 
the first stage. 

2. Find the leftmost unprocessed column in the augmented matrix that contains a 
nonzero element, say column k. Use El to make this row the first row under the 
processed row(s), and E2 to convert its leftmost nonzero element to 1, another 
leading 1. Use E3 to change all entries below the leading 1 to zero (but do not 
change the processed row or rows). Now column k is also processed. 

3. If you have not either reached the last column of coefficients (the vertical line) or 
the bottom of the matrix, go back to step 2, make another leading 1, and proceed 
from there. 


Stage 2 

4. Choose the bottommost leading 1 and eliminate all elements above it in its col¬ 
umn by use of E3. Do the same to the next leading 1 up, then the next, until you 
reach the top. 

The process is now finished. We shall illustrate the interpretation of the results 
with an example. 
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Sample Problem 8.10. Solve the system 


by row operations. 


2x + 2y + 4z = 0, 
3x — y + 2z = 1, 
8v + 8z = 2, 


Solution. The augmented matrix is 


2 

2 

4 

0" 

3 

-1 

2 

1 

8 

0 

8 

2 


At step 1 we choose the element in the (1, 1) position and divide row 1 by 2: 


1 

1 

2 

0 

3 

-1 

2 

1 

8 

0 

8 

2 


1 

7?1 -R 1 (using E2). 


Then we eliminate the rest of column 1: 


"1 

1 

2 

0 




0 

-4 

-4 

1 

R2 < 

- R2 

-3x Rl 

0 

-8 

-8 

2 

R3 < 

- R3 

-SxRl 


(using E3) 
(using E3). 


In step 2 we choose the (2, 2) position and divide by —4, then eliminate the 
entries below the (2, 2) position, obtaining successively 


1 

0 

0 

1 

0 

0 


1 

1 

-8 

1 2 

1 1 

0 0 


R2 


— x R2 
4 


R3 <r- R3 - 8 x R2 


(using E2), 


(using E3). 


There are no further numbers available for leading l’s, so we move to step 4. We 
use the (2, 2) element: 


”1 

0 

1 

l n 
4 

Rl * 

- Rl — R2 (using E3). 

0 

1 

1 

1 

4 



_0 

0 

0 

0 _ 




The process is finished. There is no restriction on z- The final augmented matrix 
converts to the system 

1 

x + z= 

4 

1 

y + z = - 4 > 

(the third equation can be ignored) and the final solution can be expressed as 
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1 

4 


z, 


y = - 



any real number z. 


It is important to notice that the sequence of calculations is completely deter¬ 
mined by the matrix of coefficients, the left-hand part of the augmented matrix. 


If the column corresponding to a variable receives a leading 1, we shall call that 
variable dependent ; the others are independent. One standard way of recording the 
answer is to give an equation for each dependent variable, with a constant and the 
independent variables on the right; the independent variables take any real number 
value. Another way to express the above solution would be to use set notation 



z eR 


or perhaps 



t e R 


In this case t is a parameter. 

Sometimes there will be no solution to a system of equations. As we said above, 
the equations are then inconsistent. 


Sample Problem 8.11. Solve the system 

2x + 2y + 4z = 0, 
3x - y + 2z = 1, 
8v -T 8z = 3. 

Solution. The augmented matrix is 


2 

2 

4 

0 

3 

-1 

2 

1 

8 

0 

8 

3 


The left-hand part of this equation is the same as in Sample Problem 8.10, so 
we go through the same steps, making the appropriate changes to the right-hand 
column. At the end of Stage 1, we have 


1 

1 

2 

0 " 

0 

1 

1 

1 

4 

0 

0 

0 

1 


When we convert back to equations, the third row gives the equation 

0 = 1 , 

which is impossible. No values of x, y, and z make this true, so the equations are 
inconsistent. There is no need to implement Stage 2. 
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In set-theoretic terms, we can report that the solution set is 0. 

Exercises 8.3 

In Exercises 1 to 14, the augmented matrix of a system of equations is shown. As¬ 
suming the variables are x,y,z, what is the solution of the system? 



"1 

0 0 

3' 



1 

0 

0 

-1 

1 . 

0 

1 0 

1 


2. 

0 

1 

0 

-1 


_0 

0 0 

0 



_0 

0 

1 

-1 


"1 

0 1 

2" 


"1 

1 

0 

2 

3. 

0 

1 - 

1 

1 


4. 

0 

0 

1 

-1 


0 

o 

o 

0 


0 

0 

0 

0 







_0 

0 

0 

0 


"1 

0 1 

2' 



"1 

0 

0 

3" 

5. 

0 

1 2 

1 


6. 

0 

1 

0 

1 


_0 

0 0 

2 



_0 

0 

1 

2 


”1 

0 1 

2 



"1 

0 

0 

2" 

7. 

0 

1 1 

1 


8. 

0 

1 

0 

1 


_0 

0 0 

1 


0 

0 

1 

1 







_0 

0 

0 

1 


' 1 

1 

1 

4 



"4 

2 


-3 

9. 

-1 

2 

1 

-i 

10. 

3 

-1 


-1 


3 

2 

4 

6 


1 

-7 


3 


In Exercises 15 to 36, solve the system of equations. 



"4 

3 

1 

11“ 


0 

1 

-1 

-1 

ii. 

2 

-2 

4 

2 

12. 

1 

0 

1 

1 


1 

3 

-2 

5 


1 

2 

-1 

0 


"2 

-1 

1 

-2" 


“2 

3 

1 

-1 

13. 

3 

2 

3 

8 

14. 

1 

2 

1 

0 


1 

-1 

-1 

0 


3 

2 

-1 

-4 


15. 2x + 6y = 6, 
4x + 11 y = 10. 


16. 3x - 2y = 4, 
—6x + 4y = 2. 


17. 2x + 3y = 5, 
4x + 6y = 10. 


18. x — y = 3, 
2x + y = 3. 
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20 . 


19. 3* — y = 4, 

6x — 2 y = 2. 

20. 3x — 2y = — 1, 

—6x + 4y = 2. 

21. x — 2y = 4, 

—3x -4y = -2, 

2x + 3y = 1. 

22. 3x + 2y = 4, 

2x + 3y = 1, 

5x - 4y = 14. 

23. x + 2 y + z = 3, 
x + y - 2z = 2. 

24. x + y + 3z = 2, 

4x + 2y + 2z = 10. 

25. x + y + z = 3, 
x ~h 2y 2z — 3, 
x + y + 2z = 1. 

26. 2x + 4z = 6, 

2x + y + 5z = 7, 
x - y + z = 2. 

27. 2x + 2z = 2, 

x + 2y + 6z = 3, 

2x — 2y = 1. 

28. x + z = 4, 

x + 4y + z = 7, 
x - 2y + z = 3. 

29. x + 2y + z = —1, 

2x + 3y — 2z = 7, 

—2x + 2y — 3z = —2. 

30. x + y — z = 4, 

3x + 4y — 7z = 8, 

— y + 4z = 4. 

31. x y = 2, 

x - y + 5z = 3, 

— 3x — 3y + 2z = —6. 

32. x + 2y + 3z = 4, 

4x + 5y + 6z = 16, 
lx 4~ 8 y 4~ 9 z — 28. 

33. 3x — 2y — 8z + It = 1, 
x + y - z - t = 3, 
x — y — 3z + 3t = —1. 

34. 2x — y + z — 3t = 2, 

— 4x — 3y + f = 1, 

2x — 6y + 3z — St = 4. 

35. x + 2y -f- 3z 4 1 = 8, 

x — 3y + 4z + 4t = 8, 

2x — 2 y — z -\-1 = —3, 
x — ly — lz — 2>t = —11. 

36. 2x + 2y — 2z + 3t = 2, 

4x — 2 y — z ~\~ t = —4, 

6x — 3z + 4t = —2, 

2x + 8y — 5z + St = 10. 


8.4 More About Linear Systems and Inverses 

Classification of Systems of Equations 

We say that a matrix is in reduced row echelon form if it has the following properties: 


All rows with every element zero lie at the bottom of the matrix; 
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• The leftmost nonzero element of each row is a 1 (called a leading 1); 

• Every other entry in the same column as a leading 1 is zero; 

• Each leading 1 is to the right of the leading 1 in the preceding row. 

So the process of solving a system of equations consists of converting its augmented 
matrix to reduced row echelon form. For this reason we often speak of reducing the 
matrix (or reducing the system). 

The system is inconsistent when the reduction process results in a leading 1 in the 
right-hand column of the augmented matrix. In this case there is no need to complete 
the algorithm. 

Suppose the system is consistent. There are two possibilities. 

Sometimes the reduced matrix will have a leading 1 in each column so that it is 
an identity matrix, possibly with some zero rows appended. In this case there will be 
exactly one solution. 

Alternatively, there will be some columns that contain no leading 1. The corre¬ 
sponding variables are independent, and can take any real value, so the system has 
infinitely many solutions. 

To illustrate these possibilities, consider the following three sets of equations: 

x+y = 2, x + y = 2, x + y = 2, 

3x + 3y = 5, 2x + 3y = 5, 2x + 2y = 4. 

The matrix forms are as follows: 


"1 

1 

2" 


"1 1 

2" 


"1 

1 

2" 

3 

3 

5 

’ 

2 3 

5 

’ 

2 

2 

4 


In each case subtract twice row 1 from row 2, obtaining 


"1 

1 

2 " 


"1 

1 

2 " 


"1 

1 

2 " 

_0 

0 

1 

’ 

_0 

1 

1 _ 

’ 

_0 

0 

0 _ 


The first system is inconsistent and there are no solutions. The second system 
can be reduced further; subtracting row 2 from row 1, we get 


"1 

0 

r 

_0 

1 

i 


and the solution is x = 1, y = 1. In the third system, y can take any value, and 
correspondingly x — 2 — y. 


So we have a classification of systems of equations into three classes. A system 
can have no solutions, exactly one solution, or infinitely many solutions. Alterna¬ 
tively, we can say that the solution set is empty, a singleton set, or infinite. There can 
be no system whose solution set is finite but has more than one member. 
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Calculating the Inverse 


The inverse of a square matrix can be calculated by the algorithm used for solving 
equations. Suppose A is an n x n matrix with inverse B. Write b\, Z> 2 , •.., b n for 
the columns of B , and write u \, « 2 , • • •, u n for the columns of the identity matrix of 
order n. Consider the equation AB = I. Column j of the left-hand side is Abj. So 
the equation is equivalent to the set of n systems 

Ab\ = «i, Ab 2 = U 2 , ..., Ab n — u n . 

If all these systems have solutions, then the inverse is formed by putting the solution 
vectors next to each other in order. If any system has no solution, there is no inverse. 

To solve Abj = uj , we reduce the augmented matrix [A \ uj] to reduced row 
echelon form. The same steps will produce this result, no matter what vector is on 
the right-hand side. It follows that we can carry out the reduction simultaneously for 
all n systems of equations. So we have the following technique for inverting an nxn 
matrix A. 

Row reduce the matrix [A \ I n ]. If the resulting matrix has form [I n \ B] then A is 
invertible, and B is A -1 . Otherwise, A is singular. 

It follows from this that if a matrix has a row with every entry zero, it must 
be singular. This is also true if the matrix has a column with every entry zero. For 
example, if A has every entry of its first column zero, then BA has every element zero 
in its first column for any choice of B , so the equation A -1 A = I cannot possibly 
be true—it must fail in the (1, 1) position. 


Sample Problem 8.12. For the following matrices, find the inverse or show that 
the matrix is singular: 


'1 

2 

-1" 


"2 

3 

2" 

0 

-2 

1 

, B = 

3 

-1 

2 

1 

1 

-1 


1 

7 

2 


Solution. For A, we have 


1 

2 

-1 

1 

0 

0 


”1 

2 


— 

1 

1 

0 

0 

0 

-2 

1 

0 

1 

0 

=> 

0 

-2 


1 

0 

1 

0 

1 

1 

-1 

0 

0 

1 


_0 

-1 


0 

-1 

0 

1 








”1 

2 


— 

1 

1 

0 

0 








0 

1 


0 

1 

0 - 

-1 








_0 

-2 


1 

0 

1 

0 








”1 

0 

— 

1 


-1 

0 

2 







=> 

0 

1 

0 


1 

0 - 

-1 








_0 

0 

1 


2 

1 - 

-2 








”1 

0 

0 

1 


1 

0 ' 








=> 

0 

1 

0 

1 


0 

-1 









0 

0 

1 

2 


1 

-2 






















8.4 More About Linear Systems and Inverses 287 


So A has inverse 


For B , we get 


1 1 
1 0 
2 1 


2 

3 

2 

1 

0 

0" 


"1 

7 

2 

0 

0 

r 

3 

-1 

2 

0 

1 

0 


2 

3 

2 

0 

1 

0 

1 

7 

2 

0 

0 

1 


_3 

-1 

2 

1 

0 

0_ 








"1 

7 


2 

0 

0 








0 

-11 


-2 

0 

i 








0 

-22 


-4 

1 

0 


1 7 2 

0 -11 -2 
0 0 0 

and the zero row on the left tells us that B is singular. 


1 -2 


1 

-2 

1 


Practice Exercise. In the above calculations, identify the steps that have been 
taken at each stage. (For example, for A, the first step was R3 <— R3 — Rl.) 


This method can be used to get a general solution for the inverse of a 2 x 2 matrix. 

Theorem 56. The matrix 


A = 


a b 
c d 


is singular if ad — be = 0. Otherwise it is invertible, with inverse 


1 


(ad — be) 


d -b 
—c a 


( 8 . 1 ) 


Proof. If a = c = 0, then ad — be = 0, and A has no inverse. 

First, suppose a and c are both nonzero. The inverse procedure is 


a b 

1 

0" 

c d 

0 

1 


ac be 
ac ad 


c 0 
0 a 


ac be 

0 ad — be 

ac 0 

0 ad — be 


Rl 
R2 

c 0 
—c a 


cR 1 

aR2 

R2 + 


R2- Rl 


(\ I be x abc "| 

1 ' ad—he' ad —, 


ad—be* 
—C 


ad—be 

a 


Rl 


Rl - 


be 


R2. 


ad—be 

If ad — be = 0, then we are finished and there is no inverse. Otherwise, notice that 
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so we have 


1 + 


be 


ad — be 


ad — be + be 
ad — be 


ad 


ac 



0 


acd 

abc 



ad—be 

ad—be 

ad 

— be 

—c 

a 

"1 

0 


d 

b "I 

ad—be 

ad—be 

0 

1 


c 

a 


ad—be 

ad—be - 


ad — be' 


Rl 

Rl 


:R 1 

l 


ad—be 


R2 


as required. 

If a / 0 and c = 0 the calculations are simpler. We obtain the inverse 


- l 

a 


0 


b_ ~ 
ad 
t 

d J 


and this is the form taken by (8.1) when c — 0. 

The case where a — 0, c / 0 is left as an exercise. □ 

The number ad — be is called the determinant of the matrix A, written det(A). 
Determinants may be defined for square matrices of any order, and it is a general 
theorem that a matrix is invertible if and only if its determinant is nonzero. For more 
details, see books on linear algebra. 


Sample Problem 8.13. Find the determinants of the following matrices and use 
them to find their inverses, if possible: 


Solution. det(A) = 3-2—1-2 = 4, so 



det (B) = 2-1 — (—2)-(—1) = 0, so B has no inverse. 


Practice Exercise. Repeat for 


C = 


1 

3 


1 

4 ’ 



4 

2 


Using the Inverse 

Consider the system of equations Ax = b , where A is an invertible matrix. Mul¬ 
tiplying by A -1 , x = A -1 Ax = A~ l b, so the equations have the unique solution 
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A~ l b. This can be used to solve the equations. This technique is not usually helpful 
in practical situations because the process of finding the inverse takes at least as long 
as solving the equations, but it is useful when there are several sets of equations with 
the same left-hand sides, or when the inverse is already known. It is also important 
in theoretical studies. 

Sample Problem 8.14. Solve the following systems: 

x + 2y — z = 3, x +2y — z = —1, 

—2 y + z = 1, —2 y + z = -2, 

x + y - z = 0, x + y - z = 2. 

Solution. We saw in Sample Problem 8.12 that the matrix of coefficients has 

inverse 

1 1 0 
10-1. 

_2 1 — 2 _ 

Now 


1 1 0 


3 


4 

1 0 -1 


1 

= 

3 

<N 

1 

<N 

_1 


0 


7 


so the first system has solution x = 4, y = 3, z = 7. 


1 1 0 


"-1" 


-3 

1 0 -1 


-2 

= 

-3 

<N 

1 

<N 

_1 


2 


-8 


and the second has solution x = —3, y = —3, z = —8. 

Practice Exercise. Solve the following systems: 

x + 2y — z = 2, x + 2y — z = 4, 
—2 y + z = 2, —2y + z = —1, 

x + y - z = 1, x 4- y — z = 3. 


Exercises 8.4 

In Exercises 1 to 6, the reduced row echelon form of an augmented system is shown. 
Say whether its solution set is empty, singleton, or infinite. 



1 

0 

3 

1 


1 

0 

0 

2 

1 . 

0 

1 

1 

1 

2. 

0 

1 

0 

1 


0 

0 

0 

1 


0 

0 

1 

-1 
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5. 


"1 

0 

0 

r 


"1 

1 

3 

r 

0 

1 

0 

2 

4. 

0 

0 

0 

2 

0 

0 

0 

0 


0 

0 

0 

0 

"1 

0 

0 

0" 


"1 

0 

2 

3" 

0 

1 

0 

0 

6. 

0 

1 

1 

3 

0 

0 

1 

0 


0 

0 

0 

0 


In Exercises 7 to 24, use row reduction either to find the inverse of the given matrix 
or to show that the matrix is singular. 


7. 


11 . 


13. 


15. 


17. 


19. 


21 . 


23. 


-2" 



8. 

"2 

-2" 


0 




_3 

—3 _ 


7" 




10. 

"1 

1 


5_ 




1 

0.5 _ 


2" 

1 




12. 

" 3 

1 

- 




-4 

-2 

0 


1 



"0 

1 O' 

1 

0 


14. 

0 

0 

1 

0 


1 



1 

o 

o 

3 


2" 



"9 

8 ' 

7 

2 


2 


16. 

6 

5 < 

4 

1 


3 _ 



_3 

2 

1 _ 

1 



1 


"1 

-2 

3 

1 



-2 

18. 

3 

5 

1 

- 

3 


-3 


_6 

4 

2 

3 


3" 



"2 

3 

-1 

4 


3 


20. 

4 

2 

3 

3 


4 



2 

7 

-6 


1 


-1" 


"1 

0 

2 


— 

1 

7 

22. 

2 

-1 

3 


2 


-8 


4 

1 

8 

2 


3" 



"1 

2 : 

3" 

5 


1 


24. 

2 

5 : 

5 

7 


4 



3 

8 ! 

8 


In Exercises 25 to 30, find the determinant, and use it to invert the matrix or show 
that it is singular. 
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"7 

4" 

26. 

"6 

4" 

27. 

"-2 2 

2 

3 _ 

_ 3 

2 

_ 2 3 

"3 

1 

5" 

29. 

"4 

8 

2" 

30. 

"6 4" 

4 

4 

7 5 


31. (i) Prove that the following matrices are inverses: 


"3 2" 


1 -2" 

1 1 

’ 

-1 3 


(ii) Use part (i) to solve the following systems: 

(a) 3x + 2y = 4, (b) x — 2y = — 1, 

x + y = 1; -x + 3y = 2. 

32. (i) Prove that the following matrices are inverses: 


"2 -2 r 


3 -3 -2 

1 0 1 


1 -1 -1 

1 -3 0 


— i 

<N 

CO 

1 


(ii) Use part (i) to solve the following systems: 

(a) 2x — 2y + z = 3, (b) 3x — 3y — 2z = 2, 

x+z = 2, x — y — z — — 1, 

x — 3y = 1; —3x + 4y + 2z = 2. 

33. Suppose a system of equations has 0 for every right-hand side. Can it be incon¬ 
sistent? 

34. Prove Theorem 56 in the case where a = 0 (and c ^ 0). 


8.5 Adjacency Matrices 


Representing a Relation by a Matrix 


Suppose A = {a\,... ,a m } and 5 = {b\,..., b n } are two finite sets, and let a be a 
relation from A to B. We define the adjacency matrix of a to be the m x n matrix 
M a with (/, j) entry 


{ 1 if ai a bj , 
0 otherwise. 


Obviously this definition depends on the order in which we take the elements of the 
two sets. If you change the order of the elements of A, this will reorder the rows 
of M a , while changing the order of the elements of B will reorder the columns of the 
matrix. 
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Sample Problem 8.15. Suppose A = {1,2}, B = {1,2, 3} and a is the relation 
from A to B defined hy 

a = {(1,1), (1,3), (2, 2), (2, 3)}. 

What is the adjacency matrix of a ? 


Solution. 


M a 


1 0 1 
0 1 1 


Practice Exercise. For the same sets A and B , what is the adjacency matrix of 
the relation 


7T = {(1,2), (1,3), (2,1), (2, 2)}? 


Conjunction of Adjacency Matrices 


Again suppose A = {a\, ..., a m } and B = {b\, ..., b n }, and consider also a third 
set C = {ci, C 2 , ..., c p }. Suppose a is a relation from A to B and f is a relation 
from B to C. Then Mp is an n x p matrix with (/, j) entry 


Pij = 


1 if bificj, 

0 otherwise. 


We define the conjunction of M a and Mp, written M a V Mp, to be the m x p 
array with (/, j ) entry 


Yij = 


1 if Y!i = \ a ikPkj > 1, 
0 otherwise. 


( 8 . 2 ) 


(If M a is m x n and Mp is n' x p, where n n\ then M a V Mp is not defined.) 


Sample Problem 8.16. Let A, B, and a be as in Sample Problem 8.15. Further 
define C = {1, 2, 3, 4} and 

P = {(1,2), (1,3), (2,1), (2, 4), (3, 3)}. 

Find M a \/ Mp. 


Solution. The matrix of ft is 

Mp 


0 110 
10 0 1 
0 0 10 


To find M a \/Mp it suffices to find the ordinary matrix product M a Mp and replace 
any non-zero element by 1 


0 12 0 
10 11 


and M a V Mp = 


1 

0 


1 

1 


0 

1 


M a Mp = 
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Practice Exercise. If A, B, C, and a are as stated and 

r = {(1,1), (1,3), (2, 2), (3,1), (3,4)}, 

find M a v M x . 


Notice that a/3 = {(1, 2), (1, 3), (2, 1), (2, 3), (2, 4)} and M a v Mp = M a p. We 
prove that this happens in general. 

Theorem 57. Let A, B, C, a, and ft be as before. Then 

(i) M a p = M a V Mp , 

(ii) M a -i = Mj, transpose of M a , 

(iii) V M T a . 

Proof, (i) Let /x = a/3, so that is the m x p array (cr i; ), where 

{ 1 if (2; picj , 

0 otherwise. 

We see that the conjunction V Mp is also an m x p array. Suppose its (/, j) 
entry is defined by y, as in (8.2). Now yij = 1 if and only if 

n 

J2mPkj > l, 

that is, if and only if ciikPkj = 1 for at least one value of k. This is true if and only if 
ota = 1 = Pkj for at least one k , or equivalently if and only if aiabk and bkfcj for 
at least one k. But this is just the statement that aiOtfcj , so &ij = 1. 

Conversely, Gij = 1 if and only if aiOifcj , which is equivalent to the existence 
of at least one bk such that aiabk and bkfcj. But now aa = 1 = fkj, so yij = 1. 

Hence M a V Mp = M a p. 

(ii) Let M t = (€ij) nxm , where €ij = aji . Then is the correct size array 
for a -1 : B —>► A, as M a -i is ft x m. Suppose the (/, j) element of M a ~i is ; 
then = 1 if and only if bia~ l aj , or equivalently ajabi, so = 1 if and only if 
eij — 1, which implies M T = M a -i. 

(iii) M'l p = by (ii) 

= Mp-\ a -i by Theorem 22 
= Mp -1 v M a -i by (i) 

= MjvMj by (ii). □ 

We observe some of the properties of these matrices. If a is a relation on a set A 
with m elements, then M a is m x m. If a is reflexive, then M a has l’s down its diag¬ 
onal; if a is symmetric, then M a is symmetric in the usual matrix sense ( aij = aji)\ 
if a is transitive and M aoi = M a V M a has 1 in the (/, j) position, then so does M a . 
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Adjacency Matrices of Graphs 

If the relation a is represented both by the adjacency matrix M a and by a graph G, 
then M a is also called the adjacency matrix of the graph G. This is such a funda¬ 
mental idea that we define it separately. 

Suppose G is a graph with v vertices {x \, X 2 , ..., x v }. Then the adjacency matrix 
Mq of G is the v x v matrix Mg with (7, j) entry 



1 if Xi ~ xj , 

0 otherwise. 


The adjacency matrix depends on the order in which the vertices are taken. However, 
changing the order of the vertices results in a matrix that can be derived from the 
original by carrying out a reordering of the rows, and then carrying out an identi¬ 
cal reordering of the columns. Many properties of matrices are left unchanged by 
operations like this. 

The adjacency matrix of a graph is necessarily symmetric. 

If G has e edges {yi, yi, • • •, y^}, then the incidence matrix Ng of G is the v x e 
matrix with (/, j) entry 



1 if vertex Xi is incident with edge aj , 
0 otherwise. 


Sample Problem 8.17. What are the adjacency and incidence matrices of K 2 ,3 ? 

Solution. We take the vertices of ^ 2,3 to be the sets {1,2} and {3, 4, 5}. Taking 
the vertices in numerical order, the adjacency matrix is 


0 0 111 
0 0 111 
110 0 0 
110 0 0 
110 0 0 


If we take the edges in order (1, 3), (1, 4), (1, 5), (2, 3), (2, 4), (2, 5), the inci¬ 


dence matrix is 


1110 0 0 
0 0 0 1 1 1 
10 0 10 0 
0 10 0 10 
0 0 1 0 0 1 


Practice Exercise. What are the adjacency and incidence matrices of K 4 I 
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The powers of the adjacency matrix of a graph G give information about G. 
Suppose Mg = Write x\, X 2 , ... for the vertices of G, and assume i ^ j. If 

= mkj = 1, this means there are edges jcjjcjt and XkXj, so there is a path from 
JCi to xj passing through x^. The number of two-edge paths from jtj to xj equals 
the number of vertices Xk for which this is true. The adjacency matrix has entries 0 
and 1, so we have actually observed that m^my = 1 when there is an xix^Xj path 
and 0 otherwise. So m ik r ^kj equals the number of different two-edge paths from 
Xi to Xj . The ith diagonal entry is J2 m ikMki As m ik = 1 or 0, mj k = m**, 

so m ik r ^ki equals the number of times m^ = 1, the degree of x\. 

Theorem 58. Mq has the degrees of the vertices on the diagonal, and the number of 
paths of length 2 joining the relevant vertices in its other positions. 

For n larger than 2, the off-diagonal entries of give the number of walks 
between the relevant vertices (not necessarily paths; for larger n , repeated vertices 
can occur). The situation with the diagonal entries is more complicated (see, for 
example, Exercise 8.5.23). 


Exercises 8.5 

In Exercises 1 to 7, write down the adjacency matrix of the relations. These relations 
were all defined in exercises in Section 4.1. 

1. Exercises 4.1.1 and 4.1.2, relations a, f>. 

2. Exercises 4.1.3 and 4.1.4, relations y, 8. 

3. Exercises 4.1.5 and 4.1.6, relations e, 0. 

4. Exercise 4.1.7, relations a, f. 

5. Exercise 4.1.8, relations p, a. 

6. Exercises 4.1.1 and 4.1.2, relations 

7. Exercise 4.1.8, relations per, op. 

8. Suppose a is a relation between finite sets A and B. What is the relationship 
between M a and 

9. Let A = {eq, #2,..., B = {b\, b 2 , ..., b n }, C = [c\, C 2 ,..., c p } and 
D = {d \, d 2 , ..., d q } be finite sets and a , f and y be relations from A to B, B 
to C, and C to D, respectively. Show that M a V (Mp V M y ) = ( M a V Mp) V M y 

(i) using the fact that composition of relations is associative; 

(ii) working directly from the definition of matrix conjunction. 

10. Let a and f be relations from A to B. What can you say about M a and Mp if 
P c oil 

11. Suppose a is a relation from A to itself. What can you say about M a if a is 
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(i) irreflexive? (ii) antisymmetric? (iii) atransitive? 

In Exercises 12 to 18, write down the adjacency matrix and the incidence matrix of 
the graph. 

12. The complete graph K$. 

13. The cycle C§. 

14. The path P$. 

15. The wheel W5 . 

16. The union C 3 U C 3 . 

17. The star K\s. 

18. The Petersen graph. 

In Exercises 19 and 20, use the adjacency matrix to answer the question. 

19. Find the number of paths of length 2 joining two vertices in the complete 
graph K 6 . 

20. Find the number of paths of length 2 joining two vertices in the wheel We. (You 
will need to consider separately the various types of pairs of vertices.) 

21. Suppose G is a regular graph. What does this tell you about Mg? 

22. Suppose G is any graph. What is the relationship between Mq and M-^l Com¬ 
pare your answer to Exercise 8 . 

23. What information do the diagonal elements of M^ give about the graph G? 


9 


Number Theory and Cryptography 


When we think of secret information, the first image in our minds may be the spy in 
a dirty raincoat lurking around comers in the sleazy part of a central European city. 
But nowadays secret messages are more likely to be sent by computer, or broadcast. 
The emphasis is no longer on keeping the messages secret. The enemy will almost 
certainly be able to read your messages; your aim is to stop your adversaries from 
understanding them. 

Encryption, or secret writing, has always been important in warfare. In antiquity, 
when very few people could read at all, quite elementary precautions could keep 
a message safe. Over the years more and more sophisticated techniques have been 
developed. 

In general, breaking a code (or decryption) involves a great many trial-and-error 
computations. With the development of the computer there has been a spectacular 
increase in the speed at which such computations can be performed, and better and 
better techniques of encryption have been developed. 

Several of the techniques of encryption and decryption involve elementary num¬ 
ber theory, so we begin by studying primes, factors, divisors, and modular arithmetic. 
We survey classical methods of encryption, and then look at the modern study of 
public-key cryptography. 


9.1 Some Elementary Number Theory 

Primes and Divisors 

We start by recalling some basic ideas that we encountered in Section 1.1. An integer 
a is a multiple of an integer b if there exists some integer c such that a = be, and we 
say b and c are factors or divisors of a. A prime number is a positive integer a , other 
than 1, that has no divisors other than 1 and a itself. The integers are made up of four 
disjoint classes: 
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- the zero , 0; 

- the units , 1 and —1; 

- the primes , consisting of the prime numbers and their negatives; 

- the composite numbers, all other integers. 

A positive composite number can be written as a product of prime numbers. For 
example, the composite number 45 can be expressed as 3 • 3 • 5. More importantly, 
this is the only way to express 45 as a product of primes: multiply two copies of the 
prime 3 and one copy of the prime 5. This is true for every positive integer, and is 
called the Fundamental Theorem of Arithmetic: 

Every positive integer can he expressed in exactly one way as the product of prime 
numbers, provided the order of the factors is disregarded. 

The expression is called the prime factor decomposition of the number con¬ 
cerned. For example, we say the prime factor decomposition of 45 is 3 • 3 • 5, or 
3 2 -5. 

The Fundamental Theorem of Arithmetic is also called the Unique Factoriza¬ 
tion Theorem. It is very important for understanding how the integers work. Unique 
factorization is the reason we do not regard 1 as a prime: if it were, then we could 
write 

45 = 3- 3- 5 = l-3-3-5 = l-l-3-3-5 = -- -, 

and the representation would not be unique. It is also why we defined prime numbers 
to be positive integers; we do not want to worry about factorizations like 

45 = 3 • 3 • 5 = (-3) • (-3) • 5. 

In case you were wondering about 1, it is the product of no primes, and can only 
be expressed that way. 

Sample Problem 9.1. Express the numbers 48, 33, 153, 59 as products of primes. 

Solution. By repeated division we see that 48 = 2 • 24, 24 = 2 • 12, 12 = 2 • 6, 
6 = 2 • 3; 3 is prime, so 48 = 2 4 • 3. Similarly 33 = 3 • 11; 153 = 3 2 • 17; 59 is 
prime. 

Practice Exercise. Express the following numbers as products of primes: 36, 
132, 47, 55. 

Sample Problem 9.2. List all positive divisors of 228. 

Solution. 228 = 2 2 • 3 • 19 so the divisors are all the numbers of the form 2 X 3 y 1 9 Z , 
where x is 0, 1, or 2; y is 0 or 1; and z is 0 or 1. So the list is 1, 2, 4, 3, 6, 12, 19, 
38,76, 57, 114, 228. 
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Practice Exercise. Find all positive divisors of 246. 

To work out the prime factor decomposition of a number a, it is necessary to di¬ 
vide by all the primes up to a. However, this process can be speeded up: for example, 
if a does not end in 5 or 0, it is not divisible by 5. Similarly, any number divisible by 
2 must end in 0, 2, 4, 6, or 8. Another useful test is to add all digits of a; if this sum 
is not divisible by 3 then a is not divisible by 3, and if the sum is not divisible by 9 
then a is not divisible by 9. 

To test whether the number a is prime , it is not necessary to test whether a has a 
prime factor greater than *Ja. (Can you see why?) 


Greatest Common Divisors and the Euclidean Algorithm 

If a and b are any two positive integers, the greatest common divisor or GCD of a 
and b is the largest integer that is a divisor of both of them. For example, the greatest 
common divisor of 6 and 9 is 3. Two numbers with greatest common divisor 1 are 
called coprime , or relatively prime. We denote the greatest common divisor of a and 
b by gcd(a, b), or, if there is no danger of confusion, simply (a, b). 

As we saw in Section 1.1, one way to find the greatest common divisor is to write 
down the prime factor decomposition of each number, and compare them. However, 
there is a more general technique available. We shall look at it now. 

Suppose a and b are any two positive integers. If a is not a multiple of b , then 
dividing b into a will result in a quotient and a remainder, there exist unique integers 
q and r, where q > 0 and 0 < r < b, such that 

a = qb + r; (9.1) 


q is the quotient and r the remainder on division by b. 

Now suppose d is any common divisor of a and b. Then d is also a divisor of r, 
because r = a — qb and d divides each term on the right. We use this simple fact to 
construct a method to find the greatest common divisor of two positive integers. 

Given positive integers a and b with GCD d , suppose dividing b into a as in 
Equation (9.1) results in quotient q\ and remainder r\. Then 

r\ = a — q\b, 

so obviously d must divide r\. So d is a common divisor of b and r\. If r\ is nonzero 
we repeat the step, using b and r\, to get a new remainder, r 2 say, and continue as 
follows: 


a = q\b + ri, 
b = + 7*2, 
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r\ = qm + r 3 , 


At each step qt > 0, 0 < r i+ i < r* and d must divide the new remainder. 

In this way we get a sequence of nonnegative integers r\, r^, r 3 ,..., with the 
properties that d divides every element and V[ > r i+ \ for each i. Such a process 
cannot continue indefinitely because there are not infinitely many positive integers 
less than r\, so it must sometime happen that r n +\ = 0 (so that the division cannot 
be repeated). We eventually get 


Tn —1 — qn r n •> 

so r n divides r n -\. Since 2 = q n -\r n -\ + r n it follows that r n divides r n - 2 , and 
so on. We finally find that r n divides a and b , then r n must divide their GCD, d. But 
d divides rn. So r n = d. 

This technique for finding greatest common divisors is called the Euclidean al¬ 
gorithm. 

Sample Problem 9.3. Use the Euclidean algorithm to find the greatest common 
divisor of 864 and 291. 

Solution. 


864 = 2-291 +282, 

291 = 1- 282 + 9, 

282 = 31 -9 + 3, 

9 = 3-3, 

so gcd(862, 291) = 3. 

Practice Exercise. Use the Euclidean algorithm to find the greatest common 
divisor of 584 and 284. 

The idea of a greatest common divisor can be extended to sets of three or more 
numbers. To find the GCD of a, b, c, first find gcd (a, b ), then find the greatest com¬ 
mon divisor of this new number with c. It will be the required answer. 

Sample Problem 9.4. Use the Euclidean algorithm to find the greatest common 
divisor of 378, 336, and 490. 


Solution. 


378 = 1 • 336 + 42, 
336 = 8 - 42, 
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so gcd(378, 336) = 42. 


490 = 11-42 + 28, 

42 = 1 - 28 + 14, 

28 = 2 - 14, 

so gcd(378, 336, 490) = gcd(42, 490) = 14. 

Practice Exercise. Use the Euclidean algorithm to find the greatest common 
divisor of 144, 156, and 162. 


Reversing the Euclidean Algorithm 


The Euclidean algorithm can be reversed. Each equation of the form 

n -i = Wi +n+ 1 , 


can be inverted to give 


Starting from the equations 


n+i = n -1 qtn. 


r n — r n —2 qn— l r n— 1» 


we obtain 


Using 


we get 


r n -1 = r n -3 - q n - 2 r n - 2 , 


r n — r n —2 qn—l(Tn—3 qn—2 r n—l) 

= (1 + q n - 2 )r n -2 - Qn—l r n—3- 


r n —2 — Fn— 4 qn—2> r n—3i 


r n = (1 + qn-2){r n -A - q n -3r n -3) ~ qn-l(r n -3) 

= (1 + q n —2)^n—4 (1 H” qn —2 qn — \)(?n—?>)' 

The algebra of these equations is not important; the main point is that each time 
we express r n as a function of earlier terms. We eventually obtain 

gcd(a, b) = r n = z\a + zib, 

for some integers z\ and Z2- This is important enough to state as a theorem (you will 
see why in the next section). 

Theorem 59. The greatest common divisor of two positive integers a and b can be 
expressed as the sum of integer multiples of a and b. 
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This sort of expression is called an integer linear form in a and b , or an integer 
linear combination of a and b. 

Sample Problem 9.5. Express the greatest common divisor of 864 and 291 as an 
integer linear form in 864 and 291. 

Solution. From Sample Problem 9.3 we know that gcd(862, 291) = 3. Revers¬ 
ing the equations in that problem, 

3 = 282 - 31-9 
= 282 - 31 • (291 - 282) 

= 32-282- 31 -291 
= 32- (864-2-291) - 31 -291 
= 32-864- 95-291, 
which is the required form. 

Practice Exercise. Express the greatest common divisor of 584 and 284 as an 
integer linear form in 584 and 284. 

Exercises 9.1 

1. List all positive divisors of the numbers 63, 64, 288. 

2. List all positive divisors of the numbers 35, 128, 162. 

3. Which of the following numbers are divisible by 3: 10,873, 44,444, 123,456, 
51,804? 

4. Which of the following numbers are divisible by 9: 41,728, 62,901, 654,321, 
71,730? 

5. Which of the following numbers are prime: 195, 257, 1001, 419? 

6. List all primes less than 100. 

In Exercises 1 to 10, find the greatest common divisors of the pair of numbers by 
finding the prime factor decompositions of the numbers. 

7. 70, 120. 8. 168, 504. 

9. 180,600. 10. 260,455. 

In Exercises 11 to \%,find the greatest common divisors of the pair of numbers, using 
the Euclidean algorithm. Then express the GCD as an integer linear form in the two 
numbers. 
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11. 60, 84. 

13. 480, 1800. 
15. 120, 144. 
17. 210,861. 


12. 234,470. 
14. 84, 180. 
16. 292,244. 
18. 64, 160. 


19. Check your answers in Exercises 9.1.7 to 9.1.10 using the Euclidean algorithm. 
In Exercises 20 to 24, find the greatest common divisors of the set of numbers. 

20. 252, 308, 504. 21. 1386, 1170, 1890. 

22. 450,696,432. 23. 105,231,273. 

24. 595,910, 1155. 

25. We said that, when testing to see whether a is prime, it is not necessary to check 
whether it has a prime factor greater than *Ja. Why is this true? 


9.2 Modular Arithmetic 

Basic Definitions 

Suppose you look at a clock and want to know what time it will be five hours from 
now. If it is currently 10 o’clock, you do not say “15 o’clock,” but rather “3 o’clock.” 
You might say “a.m.” or “p.m.,” but that is extra information that is not on the clock- 
face. In the same way, 44 hours from now it will be 6 o’clock; what day, and whether 
it will be morning or evening, is not given by clockface arithmetic. 

What you do when interpreting a clock is to subtract multiples of 12 when nec¬ 
essary. In fact, all you are concerned with is the remainder after division by 12. For 
example, 10 + 44 = 54 = 4 x 12 + 6; we ignore the 12’s and say “6 o’clock.” 

Another way of expressing clockface arithmetic is to say calculations are carried 
out modulo 12 , and we might write 

44+10 = 6 (mod 12). 

Instead of an equality sign, we also write 44 + 10 = 6 (mod 12), and say the sum 
is congruent to 6. The number 12 is called the base or the modulus. It is common to 
express the answer to a modulo 12 problem as one of the numbers in the standard 
range 0, 1,2, ..., 11, although we could equally well write 12 instead of 0. (In the 
particular case of telling the time, we say “12 o’clock,” not “0 o’clock.”) 

Another method of telling time is the military one, or 24-hour clock. This in¬ 
dicates the difference between morning and evening, and is, in fact, a modulo 24 
arithmetic. (In the military clock we say “00:30 hours,” not “24:30 hours.”) 
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Binary arithmetic, the arithmetic of even and odd, is about remainders modulo 2. 
The addition is as follows: 

0 + 0 = 0, Even + Even = Even, 

0+1 = 1, Even + Odd = Odd, 

1+0 = 0, Odd + Even = Odd, 

1 + 1=0, Odd + Odd = Even. 

Binary arithmetic comes up in many electrical applications because there are often 
two states to be considered: “current flowing” or “no current.” As an example, think 
about turning a light on or off. On many lamps, there is a knob that you twist clock¬ 
wise. If the light is on, twisting the knob changes it to “off.” Similarly, twisting from 
“off” gives “on.” If 1 means “on” and 0 means “off,” and if the process of turning the 
knob is interpreted as “add 1” (which makes sense if you think of the lamp being off 
originally), we have 

0+1 = 1 , 1 + 1 = 0 , 

which is precisely arithmetic modulo 2. 

In the same way, one can talk about modular arithmetic modulo any positive 
integer (although “arithmetic modulo 1” is not very interesting: the answer to any 
integer calculation must be 0). The standard range for arithmetic modulo m is 
0, 1, ..., m — 1, and these numbers are often called residues modulo m. If we simply 
write a (mod m), we usually interpret this as the member of the standard range that 
is congruent to a modulo m (but we often write —1 for m — 1). Congruential equa¬ 
tions, where the role of the equals sign is taken by =, are usually called congruences. 

Sample Problem 9.6. Solve the congruence x 2 = 1 (mod 5). 

Solution. The easiest technique in this small case is to test all the possibilities in 
the standard range: there are only five of them. We find 0 2 = 0, l 2 = 1, 2 2 = 4, 
3 2 = 9 = 4, and 4 2 = 16 = 1. So the answer is “x = 1 or 4.” 

Practice Exercise. Solve the equation x 2 = 2 (mod 7). 

Sample Problem 9.7. Carry out the calculations 3 3 + 2 (mod 5), 3 x 13 — 7 
(mod 9). 

Solution. Modulo 5, 3 3 = 27 and 27 + 2 = 29, so the first answer is 4. In the 
second problem, 3x13 = 39 = 3 (mod 9), and 3 — 7 = —4 = 5 (mod 9). 

Practice Exercise. Carry out the calculations 5 2 + 2 fnod7), 4 x 14 — 9 
(mod 12). 

In the first example above, the process of getting rid of 5’s and coming down to 
a number in the standard range is called reducing modulo the base. 
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The relation v = y (mod m) is an equivalence relation on the set of integers. The 
equivalence class of v is 

[x] = [y : x = y (mod m)}, 

which is usually called the congruence class or residue class of x. 


Inverses 

The most obvious difference between modular arithmetic and ordinary arithmetic is 
the existence of multiplicative inverses. In the ordinary arithmetic of integers, only 
the units, 1 and —1, have inverses, and each is its own inverse. In arithmetic mod¬ 
ulo 7, for example, every nonzero residue has an inverse: 2x4 = 8 = 1, so 2 has 
inverse 4 and 4 has inverse 2; we write 2 _1 = 4 and 4 _1 =2. Similarly 3x5 = 1 
and 6 x 6 = 1, so 3 _1 = 5, 5 _1 = 3 and 6 _1 = 6 . And, of course, l -1 = 1. 

The situation is different in other moduli, and another significant change from 
ordinary arithmetic is seen. Consider the arithmetic modulo 12. There are several 
numbers with inverses: 1, of course, and 11 (which is —1); and 5x5 = 7x7=l, 
so both 5 and 7 are their own inverses. But 2, 3, 4, 6 , 8 , 9, and 10 have no inverses. 
In fact 2 x 6 = 0, a phenomenon that can never happen in ordinary arithmetic. We 
say two numbers are zero-divisors (mod m) if they are not zero modulo m but their 
product is zero. We shall soon show that a number cannot both be a zero-divisor and 
have an inverse. 

Sample Problem 9.8. Find all inverses and zero-divisors in the arithmetic mod¬ 
ulo 10. 

Solution. Of course l -1 = 1 and 9 _1 = 9. 2 x 5 = 10 = 0 and similarly 4x5, 
6x5 and 8 x 5 are all = 0, so 2,4, 5, 6 , and 8 are zero-divisors. Finally, 3x7 = 1, 
so 3 and 7 are mutual inverses. 

Practice Exercise. Find all inverses and zero-divisors in the arithmetic mod¬ 
ulo 6 . 

The key to inverses and zero-divisors is the greatest common divisor, as the fol¬ 
lowing theorem shows. 

Theorem 60. Suppose a is an integer not congruent to 0 (mod m). 

(i) If the GCD ( a , m) of a and m is 1, then a has an inverse (mod m). 

(ii) If(a,m) is not 1 , then a is a zero-divisor (mod m). 

Proof. From Theorem 59, there exist integers h and c such that (a, m) = ha + cm. 
Since any multiple of m is zero (mod m), (a,m) = ba (mod m). If (a, m) = 1, then 
b is an inverse for a. 
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Conversely, suppose (a, m) — d > 1. The GCD divides both a and m, so we can 
find integers v and y such that dx = a and dy = m. If v had any common factor e 
with m, greater than 1, then ed would be a common factor of a and m, greater than 
d , so this is impossible. We also know that y is not a multiple of m from the fact that 
dy = m and d > 1. Now 


ay = dxy = my = 0 (mod m), 

so a is a zero-divisor (mod m) (and for that matter so is y). □ 

This theorem shows that every integer not congruent to zero modulo m either has 
an inverse or is a zero-divisor (mod m), but cannot be both. 

Sample Problem 9.9. Which of the following are zero-divisors modulo 224 ? If 
the number is not a zero-divisor, find its inverse. 

16, 53, 63, 84, 97. 

Solution. Either by the Euclidean algorithm or by factoring, we find (16, 224) = 
16, (53, 224) = 1, (63, 224) = 7, (84, 224) = 28 and (97, 224) = 1. So 16, 63 
and 84 are zero-divisors. For 53 and 97, we look at the Euclidean algorithm 
calculations: 

224 = 4 x 53 + 12, 

53 = 4 x 12 + 5, 

12 = 2 x 5 + 2, 

5 = 2 x 2+ 1, 

1 = 5 — 2 x 2 = 5 — 2 x (12 — 2x5) 

= 5 x 5 - 2 x 12 = 5 x (53 - 4 x 12) - 2 x 12 

= 5 x 53 - 22 x 12 = 5 x 53 - 22 x (224 - 4 x 53) 

= 93 x 53 - 22,224, 

so 53 _1 = 93 modulo 224. 

224 = 2 x 97 + 30, 

97 = 3 x 30 + 7, 

30 = 4 x 7 + 2, 

7 = 3 x 2+ 1, 

1=7-3 x2 
= 13 x 7-3 x 30 
= 13 x 97 -42 x 30 
= 97 x 97 - 42 x 224 


= 7 - 3 x (30 - 4 x 7) 

= 13 x (97 - 3 x 30) - 3 x 30 
= 13 x 97 - 42 x (224 - 2 x 97) 


so 97 1 = 97 modulo 227. 
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Practice Exercise. Which of the following are zero-divisors mod 132? 

49, 55, 66, 91. 

If the number is not a zero-divisor, find its inverse. 

In the preceding sample problem, we also saw a case where an element other than 1 
or —1 can equal its own inverse—another departure from ordinary arithmetic. 


Simultaneous Congruences 

We now discuss finding all integers that simultaneously solve a set of congruences. 
To be more precise, suppose each of the positive integers mi, m 2 , ..., m n is rela¬ 
tively prime to each other (we say they are pairwise relatively prime) and we wish to 
find all integers v such that 


x = a\ (mod mi), 
x = a 2 (mod m 2 ), 

(9.2) 

x = a n (mod m n ). 

We write M = mi x m 2 x • • • x m n , and define positive integers M z , i = 1,2 , ... ,n, 
by Mi = M/mi (i n other words, M = M z m z ). Consider the n congruences 

Mix = 1 (mod mi). (9.3) 

In each case (Mi, m*) = 1, so there is exactly one solution to each congruence in 
(9.3). Suppose the solution to x = ai (mod mi) is x = Xi (mod mi). Now define 

X = M\x\a\ + M 2 ^ 2^2 + • • • + M n x n a n , 

and substitute X for v in x = ai (mod mi). Since Mj = 0 (mod mi) unless j = i, 
and M = O (mod mi), and is congruent to 1 (mod mi), 

X = a t (mod m,). 

This is true for every /, so X satisfies all the congruences in (9.2), and every integer 
congruent to X (mod M) is also a solution. 

But this class of integers is the only simultaneous solution of the set of congru¬ 
ences, for if Y also satisfies (9.2), then X = Y (mod mi) for each i, and since the 
mi are relatively prime in pairs, X = Y (mod M). Hence we have shown the follow¬ 
ing result (called the Chinese Remainder Theorem because the original form of the 
theorem was contained in a third-century AD book by Chinese mathematician Sun 
Tzu). 
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Theorem 61 (Chinese Remainder Theorem). If the n positive integers mi, m 2 , 
..., m n are relatively prime in pairs, the n congruences x = < 2 / (mod m/) /mve 
and only one simultaneous solution modulo M = m\ x m 2 x ••• x m n . 

Sample Problem 9.10. Find all integers that give the remainders 2, 6, and 5 
when divided hy 5, 7, and 11, respectively. 

Solution. In the notation used above, 

M = 385, Mi = 77, M 2 = 55, M 3 = 35. 
llx = 1 (mod 5), or 2x = 1 (mod 5), 

has the solution x = 3 (mod 5), 

55v = 1 (mod 7), or 6 x = 1 (mod 7), 

has the solution x = 6 (mod 7), 

35v = 1 (mod 11), or 2x = 1 (mod 11), 
has the solution x = 6 (mod 11 ). 

Hence, 

1 = 77 x3x2 + 55 x 6 x 6 + 35 x6x5 (mod 385), 
or X = 21 (mod 385). 


Practice Exercise. Find all integers that give the remainders 3, 2, and 4 when 
divided by 5, 7, and 11, respectively. 


Exercises 9.2 

In Exercises 1 to 10, carry out the calculations modulo 5. 


1. 3 + 5. 

2. 4 x 3. 

3. 3 4 - 1. 

4. 4- (4 + 3). 

(N 

1 

00 

id 

6. 2x3x4. 

7. 3 x (4 - 2). 

8. 2 4 + 3 4 . 

9. 14 + 2 2 . 

10. 5 - 4 4 . 

In Exercises ll to 20, carry out the calculations modulo 7. 

11. 2 + 5+1. 

12. 2x4. 
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13. 6 x (4 - 2). 

14. (2 + 3) 4 + 1. 

15. 5-8. 

16. 5x4x4. 

17. 1 + 2 + 2 2 + 2 3 . 

18. (2 + 3) x (4 + 6). 


19. 3 -(2 + 3 + 4). 20. 5 4 x 4 4 . 

In Exercises 21 to 30, carry out the calculations modulo 6. 


21. 7 x (5 + 2). 

22. 2 x (5 + 3) + 4. 

23. 3x4. 

24. (7 — 3) 3 . 

25. 9 x 7 x 5. 

26. 3-5. 

27. 3 x (3 2 + 1). 

28. 1 + 2 + 2 2 + 2 3 . 


29. (2 + 3) x (5 -2). 30. 5 x 4. 

In Exercises 31 to 40, carry out the calculations modulo 2. 


31. 2x5. 

32. 3 x (3 - 2). 

33. (6 + 3) 4 . 

34. 2-3 + 1. 

35. 5 x (3 - 4). 

36. (1 + 1 + l) 4 + 1. 

37. 1-1+2. 

38. 5x3x1. 


39. l-(4-2). 40. (2 + 3) 4 +l. 

41. Find all inverses and zero-divisors in the arithmetic modulo 14. 

42. Find all inverses and zero-divisors in the arithmetic modulo 15. 
In Exercises 43 to 52, carry out the calculations modulo 9. 

43.2 x 7. 44.3 x 5-*. 


45. (4 + 3)-*. 

46. 2-3 + 7. 

47. 13 + 12. 

48. (4 x 6) 4 . 

49. 1 + 2 + 2 2 + 2“*. 

50. 5-7. 


51. (2 + 5 + 4)-*. 52. (5 + 7) 4 . 

In Exercises 53 to 62, carry out the calculations modulo 12. 

53. (7 x 5)-*. 54. 3 x (3 -6). 
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55. 5 x 5-*. 


56. 5-9. 


57. II 3 . 


58. 5x3x7. 


59. 1 + 5 + 5 2 + 5 _1 . 60. (2+ 7)-(4-6). 

61. 7" 1 x (3 + 5 + 7). 62. 5 4 x 4 4 . 

In Exercises 63 to 66, use the Chinese remainder theorem to solve the set of congru¬ 
ences. 


x = 3 (mod 5), 
x = 4 (mod 7), 
x = 1 (mod 11) 


x = l (mod 3), 
x = 2 (mod 5), 
x = \ (mod 7). 


x = \ (mod 5), 
x = 5 (mod 7), 
x = 2 (mod 13) 


x = l (mod 3), 
x = 4 (mod 7), 
x = 4 (mod 11) 


67. Prove that the arithmetic of the Boolean algebra S 2 , defined in Section 3.1, is 
not the same as arithmetic modulo 2. 


68. Verify that x = y (mod m) is an equivalence relation on the set of integers. 


9.3 An Introduction to Cryptography 

Secret Writing 

The idea of encoding data arises in many ways. Words are encoded as strings of 
binary digits whenever you use a word processor so that a computer can work with 
them. Another important use of encoding is to compress data; this is often done when 
data are sent over telephone lines, and in fact, we use data compression whenever we 
write an abbreviation. In both these examples, it is easy to decode the information. 

In the rest of this chapter we look at a different reason for encoding—secrecy. 
Sometimes you encode something and tell only certain selected people how to de¬ 
code it. This form of secret encoding is called encryption , and the study of it is 
cryptography. It has been widely used in military operations, but nowadays it is very 
common in commerce. For example, the use of a PIN number with your bank card is 
a form of encryption. 

The original message, in everyday language, is often called a plaintext , and 
the encoded version is the corresponding ciphertext. (“Cipher” is another word for 
“code.”) The process of moving from ciphertext to plaintext (the opposite of encryp¬ 
tion) is called decryption. The special information required by your ally to decrypt 
your message is called a key. 
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Suppose you send a secret message. If your enemy intercepts it, he will try to fig¬ 
ure out what it means by working out how you did the encoding—usually, he tries to 
find the key. This is called cryptanalysis , or codebreaking. The study of cryptography 
and cryptanalysis is collectively called cryptology. 

There are many mystery stories in which cryptography plays an important part. 
If you want to read some of them, I recommend The Adventure of the Dancing Men , 
a Sherlock Holmes story by Sir Arthur Conan Doyle, Edgar Allen Poe’s The Gold 
Bug , or The Nine Tailors by Dorothy L. Sayers. 


Physical Secrecy 

Some 2500 years ago the Spartans during their wars used a special device to transport 
their secret messages. They took a staff called a scytale (pronounced “sitterlee”), and 
wound a long, thin strip of parchment around it at an angle. They then wrote their 
message on the parchment, one letter on each width of parchment. When the strip is 
unwound, the letters appear to be completely random (try it yourself). But when the 
roll is wound again around another staff of equal diameter to the original staff, the 
message makes sense, but nonsense appears if you use the wrong size staff. 

As a very simple example, suppose you wanted to send the message “help is 
coming.” You would wrap a piece of parchment around your staff and write your 
message on it in a few lines. Suppose you decided to write it in three lines: the first 
line would be “HELP,” the second “ISCO,” and the third “MING” (no need to include 
spaces—in fact, if you include spaces between words, it might help your enemy to 
decode your message.) We will call the number of lines, three in this case, the cir¬ 
cumference of the scytale. You end up with something like Figure 9.1. The strip will 
actually say “HIMESILCNPOG,” but when your allies wrap it around their own staff 
(the same diameter as yours, so that the scytale comes out to have circumference 3) 
they get the welcome news “HELPISCOMING.” 

However, suppose your enemy intercepts the message. If she uses a staff of the 
same diameter, you are in trouble; but if a slightly larger staff is used, so that the 
circumference is 4, the enemy will see the picture in Figure 9.2, and try to interpret 
“HSNIIPMLOECG” 
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Nowadays this would not be a very secret method of writing, as it can be easily 
deciphered by moving around the edges of the roll; but very few people knew how 
to read at the time, and this ensured the secrecy of the scytale method. If your enemy 
could read, but did not know about the scytale method, the strip would probably be 
regarded as nonsense, or else as some strange language. 

Sample Problem 9.11. Interpret the scytale message 

MARSSEAVLESENEISANYLVEGDPOECEEES 

SRSCOTBEHOPTETADLHCMVEEEAEBDHIU. 

Solution. To decode this we try various circumferences. If we try circumference 
5 (and rearrange our message in five rows) we get complete nonsense, as follows: 

MESSVOECHTCEH 
AAEAEESOO AM A I 
RVNNGCSTPDVEU 
SLEYDERBTLE B 
SEILPESEEHED. 

However, if we arrange it in six rows, the message becomes clear: 

MANYPEOPLEH 
AVELOSTTHE I 
RLIVESBECAU 
SESECRETME 
S SAGESHAV B 
EENDECODED. 
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Practice Exercise. Interpret the scytale message 

TBSTNHAAHTERREGBIEFAAAARTRNTOE. 

Other physical cryptography techniques were used. Histiacus, a Greek ambas¬ 
sador in Persia, sent secret information home to his country by shaving a slave’s 
head, branding a message on his skull and letting the hair grow again. He then sent 
him to Greece. Others used to drug their slaves and brand messages on their backs. 
The slaves knew nothing about the important messages they were carrying. In both 
cases, it was unlikely that the slave would live very long after the message was de¬ 
livered. 


The Caesar Cipher 

An early user of cryptographic techniques was the famous Roman commander-in- 
chief and statesman Julius Caesar (100-44 B.C.). In the second century A.D. Sueto¬ 
nius ( The Twelve Caesars I, 56) describes 

his private letters to friends, the more confidential passages of which he 
wrote in cipher: to understand their apparently incomprehensible meaning 
one must number the letters of the alphabet from 1 to 22, and then replace 
each of the letters that Caesar has used with the one which occurs three 
numbers lower—for instance, D stands for A. 

As you can see, the Romans used only 22 letters in their alphabet. But, for conve¬ 
nience, let’s assume Caesar used the English alphabet and used the same alphabetical 
order as we do. We obtain Caesar’s cipher if we write the ciphertext alphabet beneath 
the plaintext alphabet, shifted three positions to the left. Notice that the alphabet 
wraps around at the end. 


plaintext: a b c ... w x y z, 
ciphertext: D E F ... Z A B C. 

We encipher a plaintext letter by replacing it by the letter beneath it. For instance, 
the message the plaintext becomes WKH SODLQWHAW. (Caesar did not use the trick 
of omitting blanks.) Deciphering is equally easy; we replace a ciphertext letter by the 
plaintext letter above it. So OHWWHU is deciphered to the message letter 

Sample Problem 9.12. What is the Caesar cipher for the house by the stream? 

Solution. WKH KRXVH EB WKH VWUHDP. 


Practice Exercise. What is the Caesar cipher for enter the maze? 
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Caesar chose to go forward three letters for no particular reason; one can, of 
course, get the ciphertext alphabet by shifting any number of positions. Since the 
English alphabet consists of 26 letters, we obtain exactly 26 such ciphers; they are 
called additive ciphers. The key is the number of places you shift; Caesar’s key 
was 3. People sometimes denote the key by the corresponding letter of the alphabet, 
the letter to which A is transformed, and in that terminology we would say Caesar’s 
key is D. The additive cipher with key 0 (or A) is no good—the ciphertext is exactly 
the same as the plaintext—so there are 25 useful additive ciphers. 

Sample Problem 9.13. The phrase RFS TW BTRFS was produced by an addi¬ 
tive cipher. All the words were regular English words. What was the plaintext? 

Solution. This is very easy because the spaces have not been removed. To save 
work, we attack the shortest word. To decode TW we write out two copies of the 
alphabet, one starting with T and the other with W, as shown. 

TUVWXYZABCDEFGHIJKLMNOPQRS 
WXYZABCDEFGHIJKLMNOPQRSTUV 

TW must have come from some pair of letters that lie one over the other. The only 
English words that appear are be , lo , and or\ one might also accept ad , even 
though it is an abbreviation. If the original was be , then to get TW the key must 
have been 18, and the plaintext was zna be jbzna , which is nonsense. If it was lo , 
the key was 8 and the plaintext was jxk lo tljxk. If it was ad , the key was 19 and 
the plaintext was ymz ad iaymz . If it was or , the key was 5 and the plaintext was 
man or woman. Only the last makes sense, so it was the plaintext. 

Practice Exercise. The phrase JGTG CPF PQY was produced by an additive 
cipher. All the words were regular English words. What was the plaintext? 

In the movie 2001: A Space Odyssey , the computer is called HAL; this can be 
obtained by moving the letters IBM one step to the right (or 25 steps to the left). The 
producers claimed that this was a coincidence. 


The Vigenere Method 

Vigenere invented an encryption method that combines several additive ciphers. In 
modern terminology we can say that the key is a string of numbers, each one between 
0 and 25. The easiest way to remember this is to convert it into a word, using A = 0, 
B = 1, .... For example, the key 1, 17, 14, 22, 13 is equivalent to the keyword 
BROWN. Then the plaintext is written out, with the keyword underneath it, repeated 
as necessary. For example, to encrypt the message the computer has become very 
useful we write 

THECOMPUTERHASBECOMEVERYUSEFUL 

BROWNBROWNBROWNBROWNBROWNBROWN. 
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Now each letter is encrypted using the additive cipher whose key is below it. In the 
example, T would become U , H would become Y, and so on. The message becomes 

UYS YBNGIPRS YOO OFTCIR WVFU HTVTQY. 

In fact, the ciphertext would not usually be broken into words. You would probably 
receive 

UYSYBNGIPRSYOOOFTCIRWVFUHTVTQY, 

or perhaps it would arrive arbitrarily broken into four-letter or five-letter groups to 
make copying errors less likely: 

UYSY BNGI PRSY OOOF TCIR WVFU HTVT QYZZ, 

where the last two letters have been added simply to make the last set of letters the 
right size. (Presumably the recipient of a message will have no trouble ignoring a 
couple of nonsense letters at the end.) 

Exercises 9.3 

1. What is the difference in meaning between the words encode and encryptl 
In Exercises 2 to 9, decrypt the following scytale message. 

2. PSNRMLEDEIESMAEAEORS. 

3. WALOANLFKDTFESHEUMEEPEC. 

4. WRLHSEETRNASRCWOIEMTREYEWIKATISSNFRGIUS. 

5. HRSOETWMWMIEALMNEAYSRMMCOUH. 

6 . TNIOHEHSNFOTEWVTDWSEEHORPRN. 
IFIAETSSTRTOMEITHRECNAESTRG. 

7. ISDIGBTATNOEILHPOISLAODNAYTSFWTAASOAR 
CSERNUKISTTTNNSUOHOGINFUWLOEANLENMWIE 
MOUIVDAF S F EGNATERE. 

8. CTNITOHTNHDEFTEEMOEWBORLORSMLRETOIL 
AIFGDKMSETIPENONOCCDGRREA 
ITEIYSATN. 

9. LGOABIAIRFRENPKEHSEGHEAIENIIMTS 
ESNCAGTMACSNEOSDRUYNRTLYBOEYOY 
PTFRCHLTLTAAAAOEHLEVCGTESSEKRY. 

10. In the scytale method, what is the keyl 
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(A) The staff. (B) The diameter of the staff. 

(C) The strip of parchment. (D) The plaintext. 

In Exercises 11 to 14, how would Caesar encode the message (if he spoke English)? 

11. Send troops. 12. Birthday greetings. 

13. The end is near. 14. All sales are final. 

In Exercises 15 to 18, assume Caesar spoke English. What command was he send- 
ing? 

15. DWWDFN. 16. PRYH QRUWK. 

17. UHWUHDW. 18. EULQJ JROG. 

In Exercises 19 to 22, encrypt the following messages using an additive cipher with 
key 7. 

19. The moon has risen. 20. Sell all your gold stocks. 

21. Seven knights are approaching. 22. The rent is due. 

In Exercises 23 to 26, decrypt the following messages using an additive cipher with 
key 5. 

23. IT STY UFXX LT. 24. MJWJ YTIFD LTSJ YTRTWWTB. 

25. GWJFI FSI HNWHZXJX. 26. F HMNHPJS NS JAJWD UTY. 

In Exercises 27 to 30, the message was encoded with an additive cipher. Assuming 
they consist of ordinary English words, find the key and the plaintext. 

27. NBLF MPWF OPU XBS. 28. E XEPI SJ XAS GMXMIW. 

29. AMVL QV BPM KTWEVA. 30. VLPSOH SOHDVXUHV. 

In Exercises 31 to 34, use the Vigenere method to encrypt the plaintext Mary had a 
little lamb with the given keyword. 


31. FERRY. 
33. CUP. 


32. PREVIEW. 
34. BANANA. 
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9.4 Substitution Ciphers 

Generating Substitution Ciphers 

In an additive cipher, each letter is changed into another fixed letter. For example, in 
the Caesar cipher, every A becomes a D, every B becomes an E, and so on. To gen¬ 
eralize this, we make a fixed substitution for each letter, but use a more complicated 
rule than simple addition. You can write out the complete alphabet, and the list of 
substitutions, such as 

plaintext: abcdefghi j k l m, 

ciphertext: DAFBQKMCZYEGH, 
plaintext: n o p q r s t u v w x y z, 

ciphertext: WXPRS I TLJNVUO. 

In this code the message an elephant becomes DWQGQPCDWT. (Spaces and punc¬ 
tuation are usually ignored.) There are arithmetical ways to break codes such as this, 
based on the frequency of letters. For example, E is the most common letter in En¬ 
glish, so if you think your message is written in English, try putting E for the most 
commonly occurring letter. This method only works for reasonably long messages; 
for short ones, you need the key (the whole substitution table). 


Substitution Ciphers with a Keyword 

One simplification of the substitution method involves the use of a keyword. The 
second row of the table is constructed by writing first the keyword and then the rest 
of the alphabet in order. The keyword should not be too short, it should contain no 
repeated letters, and preferably at least one of the letters should be rather late in the 
alphabet. For example, keyword CRAZY produces the table 

plaintext: abcdefghi j k l m, 

ciphertext: CRAZYBDEFGH I J, 
plaintext: nopqrstuvwxyz , 
ciphertext: KLMNOPQSTUVWX. 

Sample Problem 9.14. Which of the following would be a good keyword for a 
substitution code? UP, COPPER, RAYON, FACED. 

Solution. UP is too short. COPPER contains a repeated letter. RAYON is good. 
FACED is too close to the beginning of the alphabet; most letters would be un¬ 
changed. 

Practice Exercise. Which of the following would be a good keyword for a sub¬ 
stitution code? BIRTHDAY, APPLE, TEA, BEAD. 
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This method has the advantage that the encryption method is easy to remember, 
provided you choose a memorable keyword. However, long words with no repeated 
letters are not common. So a reduced keyword (or reduced key phrase ) is used. Every 
time a letter is repeated, it is deleted from the key. This allows longer words and 
phrases to be used. For example, Americans have no trouble remembering the name 
George Washington ; it gives the reduced key GEORWASHINT and the table 

plaintext: abed efghi j k l m, 

ciphertext: GEORWASH I NTBC, 
plaintext: n o p q r stuvwxyz , 

ciphertext: DF J KLMPQUVXYZ. 


Letter Frequencies 


Here is a table of the frequency of letters in everyday English. Out of every thousand 
letters, the expected number of times a letter will appear is 


A 

72 

B 

11 

C 

33 

D 

40 

E 

126 

F 

30 

G 

18 

H 

33 

I 

76 

J 

2 

K 

3 

L 

36 

M 

25 

N 

76 

0 

74 

P 

27 

Q 

3 

R 

83 

S 

58 

T 

90 

u 

30 

V 

13 

w 

14 

X 

5 



Y 

21 

z 

1, 




so the most common letters are E, followed by T, then R, and then I, N, O, and A are 
of approximately equal frequency. 

Of course, this is experimental data from extensive counting, and another long 
count could give slightly different frequencies. Also, the frequencies will vary when 
different sorts of prose are considered. As an example, the relative frequency of let¬ 
ters in Lincoln’s Gettysburg Address per thousand letters is 


A 

88 

B 12 

C 27 

D 

50 

E 

144 

F 23 

G 24 

H 

70 

I 

59 

J 0 

K 3 

L 

36 

M 

11 

N 67 

0 81 

P 

14 

Q 

1 

R 69 

S 38 

T 

109 

u 

19 

V 21 

W 24 

X 

0 



Y 9 

Z 0. 




An Example 


We give a detailed example of how frequency data are used. Consider the ciphertext 
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ZJR KRBRYMT DYQRYRQ ZJR WRPDBQ 
NYIKMQR ZD MZZMPO MZ ZJYRR 
ZJIYZG MU ZJR ZJIYQ NYIKMQR 
DBR JDAY TMZRY DB ZJR TRSZ 
ZJR SDAYZJ ZD ORRH IB YRWRYCR. 

In this example, and in the exercises for this section, we include spaces (but not 
punctuation), to make the decryption a little easier. But this would not usually happen 
in the real world. 

First we calculate the letter frequencies. There are 116 letters, and the frequencies 
are 


A 2 

B 5 

E 0 

F 0 

I 5 

J 10 

M 8 

N 2 

Q 6 

R 22 

U 1 

V 0 

Y 13 


Cl D 8 

G 1 HI 

K 3 L 0 

0 2 P 2 

S 2 T 3 

W 2 X 0 

Z 17. 


The two most common letters are R and Z, so we first guess that R stands for e 
and Z stands for t. Making these substitutions, we obtain 

tJe KeBeYMT DYQeYeQ tJe WePDBQ 
NYIKMQe tD MttMPO Mt tJYee 
tJIYtG MU tJe tJIYQ NYIKMQe 
DBe JDAY TMteY DB tJe TeSt 
tJe SDAYtJ tD OeeH IB YeWeYCe 

(our substitutions are shown as lower case letters). The most commonly occurring 
two-letter combination in English is th and RJ occurs eight times in the message. If 
R represents t then probably J stands for h. So 

the KeBeYMT DYQeYeQ the WePDBQ 
NYIKMQe tD MttMPO Mt thYee 
thIYtG MU the thIYQ NYIKMQe 
DBe hDAY TMteY DB the TeSt 
the SDAYth tD OeeH IB YeWeYCe. 

Y occurs 13 times, and if we substitute r for Y we get the word three in the 
message; it is very hard to imagine any other translation of thYee. So 
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the KeBerMT DrQereQ the WePDBQ 
NrlKMQe tD MttMPO Mt three 
thlrtG MU the thlrQ NrlKMQe 
DBe hDAr TMter DB the TeSt 
the SDArth tD OeeH IB reWerCe. 

The third word may well be ordered. So replace D by o and Q by d. This 
makes sense because D occurred eight times, a relatively high frequency, and o is a 
common letter. The message becomes 

the KeBerMT ordered the WePoBd 
NrlKMde to MttMPO Mt three 
thlrtG MU the third NrlKMde 
oBe hoAr TMter oB the TeSt 
the SoArth to OeeH IB reWerCe. 

It looks as though thlrtG might be the word thirty. So we make those sub¬ 
stitutions 


the KeBerMT ordered the WePoBd 
NriKMde to MttMPO Mt three 
thirty MU the third NriKMde 
oBe hoAr TMter oB the TeSt 
the SoArth to OeeH iB reWerCe. 

Now replace B by n and M by a to form the small words on, one, and at. (We 
choose on rather than of because one is a word but of e is not; neither in nor it 
were available because i has been used already.) 

the KeneraT ordered the WePond 
NriKade to attaPO at three 
thirty aU the third NriKade 
one hoAr Tater on the TeSt 
the SoArth to OeeH in reWerCe. 

to make various recognizable words, we replace K by g, T by 1, W by s, and P by c. 
Then N becomes b and 0 becomes k 

the general ordered the second 
brigade to attack at three 
thirty aU the third brigade 
one hoAr later on the leSt 
the SoArth to keeH in reserCe. 

The final substitutions are v for C, m for U, u for A, f for S, and p for H. The 
plaintext is 
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the general ordered the second 
brigade to attack at three 
thirty am the third brigade 
one hour later on the left 
the fourth to keep in reserve, 


and the encryption table was 

plaintext: abode f g h i j k l m, 

ciphertext: MNPQRSKJI O T U, 

plaintext: n o p q r s t u v w x y z, 

ciphertext: B D H Y W Z AC G. 

A few letters are not filled in because they were never used. 

Exercises 9.4 

In Exercises 1 to 4, use a substitution cipher to encrypt the message “encipher is 
another word for encrypt”, with the given keyword. 

1. DROVE. 2. BRAZEN. 

3. EXTRA. 4. BOTANIC. 

In Exercises 5 to 8, use the reduced keyphrase method to encrypt the message “enci¬ 
pher is another word for encrypt,” with the given keyphrase. 

5. GIANT LIZARD. 6. EXTRATERRESTRIAL. 


7. MOTHER'S WORRY. 8. DIXIELAND JAZZ. 

9. A substitution cipher produces the ciphertext KLKEVQOLLMPAOOFTER . As¬ 
suming the message was in simple English, what was the keyword? 

(i) PROW. (iii) BOXCAR. 

(ii) AZURE. (iv) OXEN. 

In Exercises 10 to 17, decide whether the given word would be a good keyword for a 
substitution code. 

10. POTATO. 11. AORTA. 

12. IMAGINE. 13. DAYLONG. 

14. AZIMUTH. 15. PIE. 


16. DAZZLE. 17. LAZY. 

18. Encrypt the following messages using the reduced key phrase with key seven 
lazy days. 
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(i) It is common to illustrate sets and operations on sets by diagrams. 

(ii) Find a Boolean expression corresponding to the following circuit. 

(iii) Three boys and four girls are to sit along a bench. 

(iv) Prove that any two diagonal matrices commute. 

In Exercises 19 to 22, repeat Exercise 18 using the given key phrase. 

19. Merry Christmas. 

20. Feet of clay. 

21. Fox and geese. 

22. Dazed and confused. 

23. In the following sentence from a book on the history of mathematics, the most 
common letter is not a vowel. Decipher the quotation 

PWY QPCBN CZ ZYBDXPQ JXQP PWYCBYD AQ AVYIPBATXUJN 
JAVHYO FAPW PWY WAQPCBN CZ DXPWYDXPATQ. 

24. ONBBCSNNE AI T ZBTHQ SOQXQ ZQNZBQ UXNW ANST 
WAIPTLQ QTHO NPOQX UNX WNRAQ IPTXI. 

The above show business quotation has been enciphered using a substitution 
where P was enciphered as Z. To save you counting, letter frequencies are 


A 4 

B 4 

C 1 

D 0 

E 1 

F 0 

G 0 

H 2 

/ 4 

J 0 

K 0 

L 1 

M 0 

N 9 

0 4 

P 3 

Q 9 

R 1 

S 3 

T 6 

U 2 

V 0 

W 3 

X 5 


Y 0 

Z 3. 



What is the plaintext? 

25. Decrypt CDKLI QLT KLRDBIL ABI JFM HM HV MPL ECFMP 
APC QFVM GHZPM BTK KHL. 

26. Decrypt ABC DEFACG HAIACH JFKK ECMCL NC I OFMFKFPCG 
OQDEALR DEAFK JC HSCEG TQLC VQL NQQWH ABIE JC GQ 
QE OBCJFEX XDT. 

27. Decrypt EKD AZEFLFHE YDWFDMDH DMDNOEKFCP KD NDBTH AC 
EKD QBSXDE AI B CDG YAAX. 
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9.5 Modern Cryptography 

The RSA System 

In the classic cryptographic systems, knowing the method of encryption allows you 
to decrypt a coded message. Modem developments have centered on cases where 
this is not true—even when the encryption method is known, decryption takes a 
prohibitively long time. 

Several cryptographic systems with this property have been developed. For our 
purposes, it is sufficient to discuss one of these. We shall examine a system that was 
invented by Ronald L. Rivest, Avi Shamir, and Leonard M. Adelman. It is usually 
called RSA. 

We shall describe their system by giving a simple example. Your friend, the per¬ 
son to whom you wish to send a secret message, must choose two prime numbers— 
let us call them p and q. The least common multiple of p — 1 and q — 1—the smallest 
positive integer such that both p — 1 and q — 1 divides it—will also be important, 
so we will give it a name; call it m. Your friend then chooses an integer r, greater 
than 1, which is relatively prime to m (so no proper divisor of r can also be a divisor 
of p — lovq — l). Your friend also has to find a number s such that r-s = 1 (mod m). 
(This can be done using the Euclidean algorithm.) 

For the purposes of our example, let us suppose your friend chose p = 7 and 
q = 17, so that p — 1 = 6, q — 1 = 16, and m = 48. Then r can have no proper 
divisor that also divides 6 or 16. The numbers 2, 3, 4, and 6 are impossible, but 5 or 7 
are allowable, so let’s say she chose 5. The number m must satisfy 5-s = 1 (mod 48). 

So 5 -s must be one of the numbers 1,49, 97, 145, 193,_As 5-s must be divisible 

by 5, the product is 145, and s = 29. 

Say you wish to send the message CAP. You first convert the message to numbers 
by replacing by 1, B by 2, ..., and Z by 26. The plaintext CAP becomes 3116. To 
convert a number from plaintext to ciphertext, raise it to the power r and then reduce 
the answer modulo pq. Using r = 5, pq = 7-17 = 119: 

3 5 = 243 = 5 (mod 119), so 3 is encoded 5; 
l 5 = 1 = 1 (mod 119), so 1 is encoded 1; 

16 5 = 67 (mod 119), so 16 is encoded 67. (We will show how this was calculated 
in a moment.) 

Therefore the ciphertext for CAP is 5167. 

When your friend receives the message 5167, she raises each number to the 
power s , which is 29 in this case. The number 5 29 is very large—bigger than 70 
million million million—but it is easily shown to be 3 modulo 119 (see the next sub¬ 
section). So 5 is decoded as 3. Similarly 1 decodes to 1 and 67 to 16, and CAP is 
recovered. 
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Formally, we would say that encryption is carried out by converting the mes¬ 
sage to a sequence of numbers and applying the encryption rule x i-> x r (mod pq)\ 
sometimes one refers to the encoding function E{x) = x r (mod pq ), or calls r the 
encryption exponent. Similarly, decryption is carried out by applying the decoding 
function D(x) = v lS ’ (mod pq) and converting back from numbers to a message. 

This method is not restricted to the 26 letters of the alphabet, represented by the 
numbers 1 to 26. The only restriction is that the number of symbols must be smaller 
than the modulus pq , or else there would not be enough residues to represent all 
the symbols. But this is no problem. In the real world, RSA encryption is used with 
moduli that run to hundreds of digits. 

In practice, you would not transform CAP into the sequence 3116. Instead you 
would run the numbers together, putting in a 0 so that A is written as 01 (every let¬ 
ter becomes two digits, or else you could not tell whether 12 represented AB or L), 
and encrypt the number 30,116 by a single application of the encryption function. 
In general, one could make the whole message into one big number (either ignor¬ 
ing spaces between words, or putting 27 for a space), break it up into a sequence 
of smaller numbers—10-digit or 50-digit numbers perhaps—and encrypt these num¬ 
bers. As stated above, the only restriction is that the smaller numbers, the ones that 
are individually encrypted, must be smaller than the modulus m to avoid ambiguity. 

Sample Problem 9.15. Suppose messages are constructed by substituting 01 for 

A, 02 for B, ..., and then running the digits together. What numbers represent 

ZENITH and FOREST? 

Solution. ZENITH is 260514092008; FOREST is 61518051920. 

Practice Exercise. What number represents ALPHABET? 


Binary Expressions and Powers 

We chose CAP to show you that even in a very small example, big numbers arise. To 
calculate 16 5 on a calculator, we find 16 2 -16 2 -16. Now 16 2 = 256, and by subtracting 
119 (twice) we find 256 = 18. Then 18-256 = 18-18 = 324 = 86. (Or you could 

say 18-256 = 4608. To reduce mod 119, first divide and find 4608/119 = 38.72_ 

Subtract 119 from 4608 thirty-eight times (you actually calculate 4608 — (38-119)) 
and get 86. Then 86-16 = 1376 = 67.) 

This is an example of using binary representations to calculate large powers of 
numbers. Suppose r = ( a n a n -\ ... < 21 ^ 0 ) 2 , or 

r = 2 n a n + 2f ^a n —\ 4- • • • 4- 2 a\ -\- ao. 


Then any number b satisfies 
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b r — b 2Han x b 2n lan ~ l x • • • x b 2ai x b a °. 

Now each at is either 1 or 0, so we have expressed b as a product of some of the 
terms 

b 2 \ b 2 "- 1 , ...,b 2 ,b. 

Calculating these powers of b is relatively easy: start from b , square it, square the 
result, and so on. 

When the calculations are being carried out modulo n , the arithmetic is made 
even easier by the fact that you can reduce modulo n at each step. 

Sample Problem 9.16. Calculate 4 19 and 4 19 (mod 82). 

Solution. 4 1 = 4, 4 2 = 16, 4 4 = 16 2 = 256, 4 8 = 256 2 = 65536, 4 16 = 
65536 2 = 4294967296, so 

4 19 = 4 x 16 x 4294967296 = 274877906944. 

One could reduce this answer modulo 82, but it is easier to proceed as follows: 
4 1 = 4, 4 2 = 16, 4 4 = 16 2 = 256 = 10 (mod 82), 4 8 = 10 2 = 100 = 18, 
4 16 = 18 2 = 324 = -4, so 

4 19 = 4 x 16 x (-4) = -256 = 72 (mod 82). 

Practice Exercise. Calculate 3 13 and 3 13 (mod 19). 

Notice that the calculation of 4 19 took six multiplications. Multiplying it out 
directly would take 18 multiplications. Of course, computer programs are available 
that do these calculations automatically, and for very large numbers the saving in 
time is significant. 

We now perform the calculation that was required in the RSA example. 

Sample Problem 9.17. Find 5 29 (mod 119). 

Solution. We use the decomposition 29 = 16 + 8 + 4 + 1. We calculate 5 2 = 25, 
5 4 = 625 = 30 (mod 119), 5 8 = (5 4 ) 2 = 30 2 = 67 (mod 119), 5 16 = 67 2 = 
86 (mod 119), and 


= 5 16 

• 5 8 

• 5 4 

•5 (mod 119) 

= 86 

67- 

30 

•5 (mod 119) 

= 86 

67- 

150 (mod 119) 

= 86 

67- 

31 

(mod 119) 

= 86 

2077 

(mod 119) 

= 86 

54 

(mod 119) 

= 4644 = 

3 

(mod 119). 


Practice Exercise. Find7 29 (mod 53). 
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Public Key Cryptosystems 

RSA can be used to construct a public key cryptosystem. Suppose several different 
people wish to send secret messages to the same recipient. One example is a bro¬ 
kerage house, where the customers want to send their brokers orders to buy or sell. 
Another is military intelligence, where different operatives all report to the same 
central office. More generally, we can think of an electronic mail service, in which 
anybody might wish to send a message to any member of the service. We refer to 
those who can send and receive messages as members of the system. There may also 
be other users who receive, but do not send, messages. 

Nowadays we must assume that any outsider can read any computer system; 
hackers have broken into even highly secret government and military computers. So 
how can we keep outsiders from understanding our messages, even though they can 
read the messages? The answer must lie in encryption. 

Here is one method, using the RSA scheme. The recipient can publish the key— 
the product pq and the power r. This is enough information for a member of the 
public to encrypt a message. However, when the ciphertext is received, the least 
common multiple of p — 1 and <7 — 1 is needed for decryption. In theory, this can 
be calculated if you know pq ; you just break this number down into its factors p 
and q, and then you know p — 1 and q — 1. For example, it takes only a fraction 
of a second for a computer to find that 119 = 7-17. But breaking a large number 
down into factors is a very hard computation. It is possible to choose primes p and 
q so large that, if you are given only their product, it would take a computer many 
years to find p and q . Your enemy does not have years to spend, so only the proper 
recipient can decrypt the ciphertext. 

Other methods have also been developed, and RSA is just one example. But 
all the systems work in this way. Every possible recipient has an encryption method. 
This is “published:” they might actually publish the key in a directory, or users might 
be given access to a computer program that encrypts the message when the recipi¬ 
ent is specified, and then delivers it. In any event, member A somehow provides a 
(public) function, call it e&, that encrypts messages. Then A, and only A, has access 
to a (private) decryption function, cIa say, that decrypts. If the plaintext message m 
is delivered to the system, then A receives the message ekirn), and then calculates 
Ja(£a(^)), which equals m. 

Suppose you subscribe to an electronic news service. In this case, there is only 
one member of the system, and all other users only need to decrypt messages. The 
system will select an encryption function for you and send details of the decryption 
method to your computer. These details will again be encrypted, so that all details of 
the system are kept secret by the news provider. 
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Security of RSA 

The most important feature of any public key cryptosystem is that, even though you 
know the function e&, it must be an extremely long computational task to calculate 
the function cIa . 

In discussing any computational problem, it is important to know how long it 
takes to perform the computation. Usually larger problems take more time. For ex¬ 
ample, we saw in Section 3.4 that a circuit to add two binary digits needs four gates, 
while our circuit to add three binary digits needed nine gates. If your computer took 
the same amount of time to utilize any gate, increasing the size of the problem from 
two to three (the number of binary digits to be added) will multiply the computation 
time by |. 

Suppose stating a computational problem requires inputting n binary digits to a 
computer. The number of steps required to carry out the computation will depend on 
the size of n. Say the solution requires f(n) steps. The function f(n) is called the 
complexity of the algorithm. Sometimes one particular instance of a problem can be 
solved more quickly than another, so f(n) will be the largest solution-time for size 
n problems (the worst case). 

If f(n) is a polynomial function of n (or smaller), we say the algorithm is poly¬ 
nomial. If no polynomial algorithm is known for solving a problem, it follows that 
increasing the size of the problem will cause a very great increase in the time needed 
to solve it. 1 Some problems, such as finding the greatest common divisor of two 
integers or testing whether a graph is Eulerian, have polynomial algorithms. There 
are many other problems, such as the Traveling Salesman problem, for which no 
polynomial algorithm is known. 

Suppose you are given the public information, m and r, for an RSA system. To 
decipher messages, you need the number s. To calculate s you need to know not only 
r but also (p — 1 )(q — 1), and to calculate (p — 1 )(q — 1) you need to know the 
factors p and q , that is, you need to decompose m into its prime factors. But no poly¬ 
nomial algorithm is known for factoring integers, and many people believe that no 
such algorithm exists. The difficulty of factoring becomes much greater as the prime 
factors become larger. So we can make an RSA system safe against codebreakers 
by making the modulus a product of two factors, and making those factors large. 
Current suggestions are that p and q should be at least 100 decimal digits in length. 

Of course, it is always possible that somebody will discover a polynomial algo¬ 
rithm to factor integers, and that would make RSA insecure. Many mathematicians 
believe this to be theoretically impossible, but no proof is known. 


1 This informal discussion of complexity is very superficial and is only intended to give you a 
rough idea of why RSA is secure. The interested reader should consult books on cryptography 
or computational theory. 
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Exercises 9.5 

1. Consider an RSA scheme with p — 5 and q = 11. 

(i) What are the possible r values if r < 10? 

(ii) If r = 3 is chosen, use the system to encrypt the plaintext “23.” 

2. Consider an RS A scheme with p = 7 and q = 11. 

(i) What are the possible r values if r < 10? 

(ii) If r = 5 is chosen, use the system to encrypt the plaintext “13.” 

3. An RSA encryption system has pq = 143 and r — 43. Find p, q, and s. 

4. An RSA encryption system has pq = 143 and r — 23. Find p, q, and s. 

5. Use the RSA scheme with pq = 85 and s = 3 to decrypt the message “13.” 

In Exercises 6 to 14, assume you are using an RSA scheme with p = 13 and q = 23. 
Can the given number be chosen as a value for r? 


6. 6. 

7. 5. 

8. 9. 

9. 33. 

10. 31. 

11. 7. 

12. 2. 

13. 35. 

14. 55. 

Evaluate the expressions in Exercises 15 to 32. 


15. 2 45 (mod 11). 

16. 3 19 (mod 23). 

17. 2 21 (mod 44). 

18. 3 25 (mod 37). 

19. 4 12 (mod 19). 

20. 2 17 (mod 81). 

21. 2 25 (mod 7). 

22. 5 33 (mod 29). 

23. 12 50 (mod 35) 

24. 7 28 (mod 17). 

25. 2 81 (mod 13). 

26. 3 54 (mod 61). 

27. 2 53 (mod 11). 

28. 17 35 (mod 31). 

29. 4 37 (mod 51). 

30. 3 29 (mod 77). 

31. 9 23 (mod 79). 

32. 6 41 (mod 33). 


33. You are using an RSA system with r = 3 and pq = 55. 

(i) What is the decryption key for this system? 

(ii) You code A as 01, B as 02, and so on, and send each symbol separately. 
How would you encrypt these messages? 

(a) MAN DEAD, (c) EVIL, 

(b) FIFTEEN, (d) TOMORROW. 

34. Repeat the preceding problem using an RSA system with r = 5 and pq = 51. 

35. Using an RSA system with s = 13 and pq = 51 you receive the message 04 01 
05 01. Decrypt it (01 = A, etc.). 
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36. Using an RSA system with s = 5 and pq = 51, decrypt the message 04 20 01 
04 20 05 04. 

37. An encoded message 08 15 15 25 01 25 49 20 20 49 has been intercepted. It was 
sent to someone whose public key encoding function is given by 

m m 33 (mod 55). 

(i) Find s so that 

(mod 55), 

is the decryption function. 

(ii) Use this decryption function to find numbers that correspond to the message. 

(iii) It is assumed that the person who sent the encoded message used the corre¬ 
spondence 

ci i—> 01, b i—> 02, c i—> 03, ..., z i—^ 26, 

to convert letters to numbers and then used the above function to encode the 
numbers. Find the message. 

38. An RSA scheme uses modulus 91 and encryption rule m h-> m 29 . 

(i) What are the prime factors (p and q) of the modulus? 

(ii) What is the decryption rule? 

(iii) You receive the ciphertext 

72 88 88 72 13 21 72 88 33 51 51 33. 

If the encryptor is using the correspondence 

1 i-> 1, ..., 9t-^9, 0t-^10, A i-^ 11, ...h+36, 

to turn English letters and digits into numbers, what was the original plain¬ 
text? 

9.6 Other Cryptographic Ideas 

Signature Systems 

The public-key encryption schemes described above can be used to protect your mail 
so that others cannot read and understand it, but there is another problem: how can 
you be sure who really sent the message? 

In a written message, we usually expect the writer to sign the document. This 
is how banks verify checks and lawyers verify wills. But the traditional “pen-on- 
paper” signature does not apply to electronic information transfer. We need some 
sort of electronic “signature,” a means of authenticating a document. 
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Suppose you are using a system in which every member has made public an 
encryption rule and has a private decryption rule. For example, if the system is based 
on RSA, the public information consists of an exponent, a modulus, and instructions 
to raise that integer to the exponent and reduce modulo the modulus. (Of course, 
there are also some instructions on how to convert a message to an integer, but we 
shall assume this is done automatically.) For simplicity let us denote A’s encryption 
rule asm i-> e^(m) and her decryption rule by c i-> Ja(c)- 

We assume that A wishes to send message a to B. We assume that B will know 
the message comes from A —if this is not to be public knowledge, it can be included 
in the message a. A first sends the message esia). She then sends Ja(^), where b is 
some other message (B’s name, or the first part of a , or possibly—if added security 
is needed—the whole of a). B first calculates dsiesici)), recovering the original 
message. Then he calculates ^a(^aW) as a check on authenticity. (We assume he 
knows the message comes from A —if this is not to be public knowledge, it can be 
included in the message a.) 


Avoiding the Key Exchange 

Suppose two people wish to communicate using a cryptosystem, but they do not want 
to publish their encryption methods. This is still possible; we shall describe a method 
called the Massey-Omura scheme , also known as Shamir’s no-key algorithm. The 
two participants are Alice (A) and Bob ( B ). Each chooses an encryption scheme. 
We again denote A’s encryption rule as m i-> ^a(^) and her decryption rule by 
c i -> cIa(c). 

Say Alice wishes to send message a to Bob. She first sends the message ^a(^)- 
Bob does not know Alice’s secret key, so he cannot decipher the message. Instead, 
he reencrypts it with his own encryption method. So 

Alice sends e^ict) to Bob; 

Bob sends £ 5 (^( 0 )) to Alice. 

The message received by Alice is incomprehensible to anybody. Alice proceeds 
to decode it with her own secret key, then sends it back to Bob. Bob then decodes the 
message with his own key. So 

Alice sends ^(^(^aOz))) to Bob; 

Bob calculates ds (^(^(^aO^O)))- 

In general, eB(dA(es(eA(a)))) will be a string of nonsense. But suppose the two 
functions Ja and es commute. Then Ja^M) = £#(^a W) for any x, so 

d B (d A (eB(eA(a)))) - d B (e B (d A (e A (a)))) = d B (e B (a )) = a, 


and Bob recovers the message. 
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Here is an analogous situation that may clarify what is going on here. Suppose 
Alice wants to send a suitcase to Bob. She does not trust the employees of the courier 
service, so she locks the suitcase. Of course, Bob does not have the key. He puts a 
chain around the suitcase, attaches a padlock, and ships it back to Alice. She then 
unlocks the suitcase—she still cannot open it because of the chain—and sends it 
back to Bob who removes the chain. 

The only drawback to this system is that two functions Ja and e b will not neces¬ 
sarily commute. For example, the RSA scheme cannot be used (see Exercise 9.7.1). 
However, RSA is suitable if the two moduli are the same. If both Alice and Bob use 
modulus m, then JaM = x s (mod m) and esM = x r (mod m) for some integers 
r and s, and for any v 

d A (e B (x)) = d A (x r ) = (V)* = (x*) r = e B (x s ) = e B (d A (x)), 

so the functions commute. So Alice and Bob would need to have some communica¬ 
tion, but the information they exchange in public—the modulus—is not enough to 
allow others to send messages. 


Tossing a Coin 

We conclude with another example of the need to authenticate information. The prob¬ 
lem is as follows: how is it possible to simulate a random process with a 50% prob¬ 
ability of success or failure? If two people meet face to face they can flip a coin. But 
this is not possible if the two participants are corresponding by telephone or com¬ 
puter. Assuming that they do not trust each other, what is to stop A from tossing 
a heads, and then, when B calls heads, saying that the toss was a tails? Instead of 
authenticating the coin toss, we shall outline a fair procedure that has a 50% chance 
that A wins and a 50% chance that B wins. 

First A selects two large primes p and q and sends the product pq (but not the 
factorization) to B. To “win the toss,” B must factor pq. What he does is select an 
integer x, prime to p and q in the range l < x < pq and send x 2 to A. Now A 
selects a number y such that y 2 = x 2 (mod pq) (a “square root” of x 2 ) and sends it 
to B. 

If y 2 = x 2 , then 

pq\y 2 - X 1 , 
pq | O' - x){y +x), 

and there are four possibilities, according to whether p divides y — x or y + x and q 
divides y — x or y + v. So we get four possible cases 
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y = x 

(mod p), 

(9.4) 

y = x 

(mod p), 

(9.5) 

y = x 

(mod q), 

y = x 

(mod q ), 

y = —x 

(mod p), 

(9.6) 

y = —x 

(mod p), 

(9.7) 

y =x 

(mod q), 

y = —x 

(mod q). 


Each of these has a unique solution modulo pq : (9.4) has solution x = y, (9.7) 
has x = — y, while (9.5) and (9.6) relate v to y in other ways. It is easy to check that 
they are all different. A cannot tell which of the four possibilities equals x, so she 
chooses one at random. 

If she chooses the solution corresponding to the cases in (9.4) or (9.7), and sends 
it back to B , this does not help B to factor pq. However, say she chooses the solution 
to (9.6). Then 

P\y + x, q | y -x. 

Since y ^ x, p does not divide y — x. So (y — x, pq) = q. So B can factor pq; 
similarly he can factor pq if (9.5) is chosen. 

Notice that A must send a number N that has two prime factors. If N is itself 
a prime number (or a power of a prime), then B always loses. This problem can 
be handled by allowing B to test whether N is prime. Ordinarily, this would be as 
difficult as factoring N, and we have already pointed out that factoring takes a long 
time. But by using a probabilistic test for primality, B can very quickly (and with a 
very small probability of error) determine whether or not N is prime. 

If A sends a number with more than two prime factors, she decreases her chance 
of winning—with n factors there are 2 n roots, and the number of cases in which A 
wins is only 2. Similarly, B can simply deny that he has the ability to factor. So either 
player can cheat in the opponent's favor, but not for his/her own gain. 

Exercises 9.6 

1. A uses an RSA cryptosystem with modulus 33 and encryption exponent 7. B 

uses an RSA cryptosystem with modulus 35 and encryption exponent 5. 

(i) Show that A’s decryption exponent is 3. 

(ii) Find [2 5 (mod 35)] 3 (mod 33). 

(iii) Find [2 3 (mod 33)] 5 (mod 35). 

(iv) Show that the functions and e b , defined by 

d A (x) = x 3 (mod 33) and es = x 5 (mod 35), 
do not commute. 

2. Repeat the preceding exercise with the message 2 replaced by 5. 

3. Calculate [5 3 (mod7)] 4 (mod 11) and [5 4 (mod ll)] 3 (mod 7). 

4. Calculate [2 3 (mod5)] 5 (mod 7) and [2 5 (mod7)] 3 (mod 5). 
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9.7 Attacks on the RSA System 

If the RSA system is secure what is the point of attacking it? 

Even though the system itself seems impervious to attack, it is possible that it 
has been implemented in such a way that it can be broken. We would then say that 
the RSA system has been compromised. In this section we shall discuss some faulty 
implementations of RSA. 


Using the Chinese Remainder Theorem 

Consider a public key system using the RSA scheme that has been set up as suggested 
in the preceding section. Member i has two primes pi and qi, and an exponent r; 
which is relatively prime to (pi — l)(qi — 1). He releases to the public the information 
(mi, ri), where mi = piqi. He keeps secret the information (pi, qi, Si) where Si is 
the inverse of 77 modulo (pi — l)(qt — 1 ): 

Si = r ~ 1 (mod (p, - 1 )(q, - 1)). 

Suppose a user (such as a system operator or a military commander) needs to 
communicate the same message to most or all members of the system. For the con¬ 
venience of this user, it has been suggested that one should use a common value r 
for all the r/ and that it should be a small number. For example, one might set each 77 
equal to 3. This would, in fact, be cheaper and quicker to operate. However, we shall 
now show that this system would not be secure. 

Fet us suppose you wish to send the message A to users U\, U 2 , and I/ 3 . (We 
assume that A is an integer that can be translated back to a verbal message by some 
prearranged system.) You would proceed to calculate the three encrypted messages 
Ci, C 2 , and C 3 and transmit Q to £//. The Ci are remainders calculated as follows: 

A 3 = Ci (mod mi), 

A 3 = C 2 (mod m 2 ), (9.8) 

A 3 = C 3 (mod m 3 ). 

We can assume the moduli m\, m 2 , and m 3 are relatively prime because other¬ 

wise the system is compromised (see Sample Problem 9.19). If a codebreaker solves 
the equations (9.8) using the Chinese remainder theorem, she obtains the solution 
A 3 = X where X is the unique integer such that 

A = X (mod m \ mim^), 0 < X < m\m 2 m^. 

In order for the recipients to decode messages unambiguously, all messages must 
be positive integers smaller than the modulus. So A < min (mi, m 2 , m 3 ). This means 
A 3 < mim 2 m 3 , and, in fact, 
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A 3 = X, 

is an equality between integers. So A equals the cube root of X (the ordinary cube 
root, in ordinary arithmetic). There is a polynomial algorithm for extracting roots in 
ordinary integer arithmetic, so A can be calculated easily by the codebreaker. (There 
is no known polynomial algorithm for extracting roots in modular arithmetic.) 

Suppose we apply the same analysis to a system with common exponent r and n 
users. We obtain the equation 

A r = X (mod m\m 2 • • • m n ), 0 < X < m\m 2 • • • m n . 

To achieve the integer equation A r = X we need A r < mi m 2 • • -m n . This will 
certainly be true if n > r. So the problem can be avoided by choosing r greater than 
n. But this will not provide the suggested savings in a large system. 

Here is another way around this attack. Say there are 100 users of the system. 
Select a random arrangement jt of the integers {1,2,..., 100}. Instead of a message 
of (say) 993715, send to user i the message 99371500 + 7 r(/). When a user decrypts 
to A, he takes L^q J as the message. 

The following example illustrates this method. The moduli are far too small for 
a practical RSA system—they involve the six smallest primes with which 3 can be 
used as an exponent (remember that if p is one of the primes, then p — 1 and the 
exponent must be coprime). However, observe that some very large numbers arise. 

Sample Problem 9.18. In an RSA-based public key cryptosystem, three users 
have moduli m\ = 46, m 2 = 77, and m 3 = 85. A message is sent to all three 
using the common encryption exponent 3, and the coded messages received are 
C\ = 37, C 2 = 62, and C 3 = 68 , respectively. What was the original message? 

Solution. We have the following equations: 

A 3 = 37 (mod 46), 

A 3 =62 (mod 77), 

A 3 = 68 (mod 85). 

In the notation of the Chinese remainder theorem in Section 9.2, A\ = 6545, A 2 = 
3910, A 3 = 3542, A = 301070, <21 = 37, <22 = 63, a?, = 68 . The numbers 
x\,X 2 ,X 2 are the solutions to 

6545v = 1 (mod 46), 

3910jc = 1 (mod 77), 

3542v = 1 (mod 85), 

respectively. Using the Euclidean algorithm we find x\ = 39, X 2 = 9, V 3 = 3. 
Therefore 
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A 3 = 6545 x 39 x 37 + 3910 x 9 x 62 + 3542 x 3 x 68 
= 12348783 = 4913 (mod 301070). 

Therefore A 3 = 4913 and A = 17. 

Practice Exercise. In the same system, the coded messages received were 21, 
35, and 3, respectively. What was the original message? 

Sample Problem 9.19. Suppose a public key system, of the kind we are dis¬ 
cussing, has two moduli m\ and m 2 that are not coprime. Why is the system 
insecure ? 

Solution. We assume that each modulus is a product of two primes; say m\ = 
pqi and m 2 = pq2, where p is the common factor. All of the moduli are common 
knowledge, so a codebreaker can easily test all pairs of them for common factors, 
and will find p. She can then calculate q\ and q 2 . This enables her to quickly find 
the inverses of r\ modulo mi and ri modulo mi and decode the messages to users 
1 and 2. 


Belonging to an Exponent 

Suppose a and m are relatively prime integers. In the arithmetic modulo m, the se¬ 
quence of powers a 1 , a 2 , a 3 , ..., cannot all be different because the arithmetic has 
only a finite number of elements, so there will exist positive integers d and e for 
which a e = a d+e (mod m). Therefore 

m | a d+e — a e — a e (a d — l). 

Since m has no common factor with a , m must divide a d — so a d = 1 (modm) 
for some values of d. The smallest such positive integer d is called the exponent of a 
modulo m, and we say a belongs to the exponent d (mod m). For example, modulo 
7, 4 1 = 4, 4 2 = 2, 4 3 = 1, so 4 belongs to the exponent 3 (mod 7). 

Sample Problem 9.20. Find the exponent of 11 modulo 84. 

Solution. Calculating modulo 84, 


II 2 

= 121 

= 37 

ll 3 

= 407 

= 71 

ll 4 

= 781 

= 25 

ll 5 

= 275 

= 23 

ll 6 

= 253 

= 1, 


so 11 belongs to exponent 6 (mod 84). 
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Practice Exercise. Find the exponent of 5 modulo 48. 

The result of the following example will be used shortly. 

Sample Problem 9.21. Find the exponent of 49 modulo 214684. 

Solution. We see 

49 2 = 2401, 

49 3 = 117649, 

49 4 = 5764801 = 183017, 

49 5 = 8967833 = 165789, 

49 6 = 8123661 = 180353, 

49 7 = 8837297 = 35253, 

49 8 = 1727397 = 9925, 

49 9 = 486325 = 56957, 

49 10 = 2790893 = 1, 

so the exponent is 10 . 

Consider an RSA cryptosystem with p = 383 , q = 563, r = 49. The public 
information is m = 215629, r = 49. Using the factorization of m (i.e., the values of 
p and q) one finds that ( p — \){q — \) = 214684, g = 56957. So encryption A —>► C 
and decryption C A follow the rule 

C = A 49 (mod 215629), 

A = C 56957 (mod 215629). 

Suppose Ci = C 49 , C 2 = C 49 , and in general C (+ i = C 49 . Then C,- = C x 
where x = 49^ (mod 214684). We just saw that 49 belongs to the exponent 10 
modulo 214684, so 49 10 = 1 and C*0 = C, C 9 = A. 

For example, say A = 123456. We see 

C = (123,456) 49 = 1603, 

Ci = C 49 = 180661, 

C 2 = C/ 9 = 109265, 

C 3 = C 49 = 131172, 

C 4 = C 49 = 98178, 

C 5 = Cf = 56372, 

C 6 = Cf = 63846, 

C 7 = Cf = 146799, 

C 8 = cf = 85978, 

C 9 = Cf = 123456 = A, 

Cio = Cf = 1603 = C. 
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So you could try to break an RSA encryption by forming the sequence C 1 , C 2 , 

C 3 ,_If this sequence contains a repetition, say C * +1 = C, then C; will equal the 

original message A. 

The “solution” to this problem is to choose a value r that belongs to a large 
exponent modulo (p — l)(q — 1). However, there is no known polynomial algorithm 
for finding an exponent. 


Fermat Factorization 

Finally, some choices of two primes are such that their product is easy to factorize. 
This happens if the two primes are close in value. Fermat proposed the following 
method for factoring an odd positive integer n into two approximately equal factors. 

Suppose the factorization of n is n = ab , where a > b. Write a; = + b), the 

number midway between a and b , and y = \ (a — b). Then a = x+y, b = x — y, and 
n = (x + y)(jt — y). This is the desired factorization. Moreover, n = x 2 — y 2 , so y = 
y/x 2 — n. If we could guess x, then we could calculate y and find the factorization. 

Fermat factorization proceeds as follows. First find *Jn. If this is an integer, we 
have found the desired factorization (n = x 2 and y = 0). Otherwise, we set x = 
[y/n] and calculate y/x 2 — n. If t his is an integer, set y = V 'x 2 — n. Otherwise, 
increase i by 1 and test Vv 2 — n again. If we continue in this way, eventually a 
factorization will be found. Even if n is a prime, the method will eventually give the 
factorization n = n x 1. 

This factorization algorithm is not polynomial, and will take a very long time if 
the two factors are not close. However, it is often very fast. In the following example, 
the algorithm concludes after three steps, although one factor is 20 % larger than the 
other. 

Sample Problem 9.22. Use Fermat factorization to factor n = 430681. 

Solution. The square root of n is 656.2_This is not an integer, so we start 

testing from x = 657. 

x = 657, a 2 — n = 968, y/x 2 — n = 31.1..., 

x = 658, x 2 — n — 2283, y/x 2 — n = 47.7 ..., 

x = 659, x 2 — n = 3600, y/x 2 — n = 60. 

So we take y = 60. The factorization is n = (x — y)(x + y), or 

430681 = 599 x 719. 


Practice Exercise. Use Fermat factorization to factor 449329. 
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In view of the Fermat factorization problem, it is recommended that the two 
primes chosen for RSA should be relatively different in size. If a 230-digit modulus 
is used, primes of about 100 digits and 130 digits, respectively, have been recom¬ 
mended. 


Exercises 9.7 

In an RSA-based public key cryptosystem , three users have moduli m\ = 46, 
m 2 = 77, and m 3 = 85. A message is sent to all three using the common encryption 
exponent 3. The coded messages received are shown in Exercises 1 to 6. In each case 
find the original message. 

1 . Ci = 2 , C 2 = 15, C 3 = 16. 2. Ci = 5, C 2 = 6 , C 3 = 59. 

3. Ci = 17, C 2 = 64, C 3 = 60. 4. Ci = 26, C 2 = 34, C 3 = 28. 

5. Ci = 43, C 2 = 22, C 3 = 56. 6 . Ci = 6 , C 2 = 50, C 3 = 2. 

In Exercises 1 to 10, find the exponent of the given integer for the given modulus. 

7. 37 (mod 66 ). 8. 13 (mod 66 ). 

9. 13 (mod 84). 10. 5 (mod 84). 


In Exercises 11 to 16, use the “ exponent ” method to decrypt the coded message. 

11. Exponent 3, modulus 55, coded message 48. 

12. Exponent 3, modulus 55, coded message 18. 

13. Exponent 7, modulus 143, coded message 63. 

14. Exponent 7, modulus 143, coded message 42. 

15. Exponent 5, modulus 117, coded message 92. 

16. Exponent 17, modulus 365, coded message 37. 

In Exercises 17 to 26, use Fermat factorization to factor the given integer. 


17. 396611. 
19. 282943. 
21. 301033. 
23. 294677. 


18. 436021. 
20. 321389. 
22. 70303. 
24. 788131. 


25 . 862091. 


26 . 499561. 


10 


The Theory of Voting 


Voting and elections are part of everyday life. We vote for our representatives in 
government, and those representatives vote on legislation. We vote for our favorites 
in television talent quests; committees vote to decide who to hire. Sports leagues 
have a structure like that of elections (instead of a vote deciding that one team is 
better than another, we use the result of a match, but the structure is very similar). 

When we look more closely at electoral methods, we see that the mathematical 
structure is very important. We shall see that two different systems, each of which 
sounds perfectly fair, can give different results. Sometimes groups of voters can ma¬ 
nipulate the system to favor their candidates. 

Not every election will have a result; it is always possible that the voters like two 
candidates equally well, and there is a tie. But we shall also see some methods that 
sound very reasonable, but do not result in the election of a candidate even when 
there are no ties. 


10.1 Simple Elections 

Simple Elections 

The simplest form of election is where there are a number of candidates for office, 
and one is to be elected. There is a well defined set of voters—the electorate —and 
each voter casts one vote. 

Suppose two people are running for an office. After each person makes his or 
her one vote, the person who gets more than half the votes wins. This is called the 
majority or absolute majority method. Ties are possible—ties are possible in any 
electoral system—but apart from this the absolute majority method always produces 
a result. 


W.D. Wallis, A Beginner's Guide to Discrete Mathematics , 
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If there are three or more candidates, the majority method is not so good; there 
may quite easily be no winner. Several schemes have been devised that allow a can¬ 
didate with an absolute majority to be elected, and try to find a good approximation 
when there is no “absolute” winner. These are called majoritarian or plurality sys¬ 
tems. 

The first generalization is the plurality or simple majority method; it is often 
called “first-past-the-post” voting. Each voter makes one vote, and the person who 
receives the most votes wins. For example, if there were three candidates, A, B , and 
C and 70 voters, the absolute majority method requires 36 votes for a winner. If 
A received 30 votes and B and C each got 20, there will be no winner under that 
method. Under plurality, A would be elected. 

The problem with the plurality method is that the winner might be very unpopular 
with a majority of voters. In our example, suppose all the supporters of B and C 
thought that both these candidates were better qualified than A. Then the plurality 
method results in the election of the candidate that the majority thought was the 
worst possible choice. This problem is magnified if there are more candidates; even 
if there are only four or five candidates, people often think the plurality method elects 
the wrong person. 


Sequential Voting 

One technique used to avoid the problems of the plurality method is sequential vot¬ 
ing. In this scheme a vote is taken. As a consequence a new set of candidates is 
selected; then a new vote is taken. The aim is to reduce the set of candidates to a 
manageable size—often to size two. The original election is called a primary. 

For example, when the city of Carbondale, Illinois votes for Mayor, there is a 
primary election for all the Mayoral candidates. This is run like an ordinary election. 
Eater there is another election; the candidates in the final election are the two can¬ 
didates who received the most votes in the primary. This second (runoff) election is 
decided by the majority method. A similar method is used in electing the presidents 
of France and the Ukraine and in a number of other situations. 

We shall refer to this as the runoff method or plurality runoff method. The two top 
candidates are decided by plurality vote; all other candidates are eliminated. Then a 
majority vote is taken. 

In the real world there is usually a delay after the primary and more campaigning 
takes place. For simplicity’s sake we shall ignore this. We assume that every voter 
has an order of preference between the candidates that remains fixed throughout the 
voting process. We define the preference profile of an election to be the set of all the 
voters’ preference lists. This can conveniently be written in a table. For example, say 
there are three candidates, A, B, C and suppose 
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5 voters like A best, then B , then C; 
7 voters like B best, then A, then C; 
4 voters like A best, then C, then B ; 
3 voters like C best, then B, then A; 
no voters like B, then C, then A; 
no voters like C, then B , then A. 

We can represent this as 


5 7 4 3 
ABAC 
B A C B 
C C B A 


(We can also write 


5 

1 

4 

3 

0 

0 

A 

B 

A 

C 

B 

C 

B 

A 

C 

B 

C 

A 

C 

C 

B 

A 

A 

B 


but we shall usually omit zero columns.) 

In a simple election, each voter votes for the candidate he or she likes best. In 
the example, A would receive nine votes—five from those with preference list ABC , 
and four from those with list AC B. In general, a candidate receives the votes of those 
who put that candidate first in the preference list. 

Sample Problem 10.1. Suppose the preference profile of an election is 


5 

1 

4 

3 

A 

B 

A 

C 

B 

A 

C 

B 

C 

C 

B 

A 


What is the result of the election in the following cases? 

(i) The majority method is used. 

(ii) The plurality method is used. 

(iii) The runoff method is used. 

Solution. A receives nine votes, B receives seven, C receives three. So (i) there 
is no majority winner (as there are 19 voters, 10 votes would be needed), and 
(ii) A is the plurality winner. In a primary election, A and B are selected to 
contest the runoff. For the runoff, C is deleted, so the preference profile is 
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5 

7 

4 

3 

A 

B 

A 

B 

B 

A 

B 

A 


or (combining columns with the same preference list) 


9 

10 

A 

B 

B 

A 


So B wins the runoff. 

Practice Exercise. Repeat this question for an election with preference profile 


1 

5 

8 

3 

4 

A 

A 

B 

B 

C 

B 

C 

A 

C 

B 

C 

B 

C 

A 

A 


The Hare Method 

The Hare method or alternative vote system was invented by the English lawyer Sir 
Thomas Hare in 1859. It is most useful in its more general form, for situations where 
several representatives are to be elected at once (see Section 10.2). In this section we 
look at the simpler version. 

The Hare method requires each voter to provide a preference list at the election. 
This list is called a ballot. The candidate with the fewest first place votes is elimi¬ 
nated. Then the votes are tabulated again as if there were one fewer candidate, and 
again the one with the fewest first place votes in this new election is eliminated. 
When only two remain, the winner is decided by a majority vote. 

Sample Problem 10.2. Suppose there are four candidates for a position, and 24 
voters whose preference profile is as follows: 


5 

7 

4 

3 

3 

2 

A 

B 

A 

C 

D 

D 

C 

D 

D 

D 

C 

C 

D 

A 

C 

B 

A 

B 

B 

C 

B 

A 

B 

A 


Who would win using the following electoral systems? 

(i) The plurality method. 

(ii) The runoff method. 
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(iii) The Hare method. 

Solution, (i) The votes for A, B , C, and D are 9, 7, 3, and 5, respectively, so A 
would win under plurality voting. 

(ii) Under the runoff method there is a tie. A and B are retained and the new 
preference profile is as follows: 


5 

7 

4 

3 

3 

2 

A 

B 

A 

B 

A 

B 

B 

A 

B 

A 

B 

A 


giving 12 votes to each candidate. 

(iii) In the Hare method we first eliminate C, obtaining 


5 7 4 3 3 2 

A B A D D D 

B D D B A B 

D A B A B A 

Now we eliminate B : 


5 7 4 3 3 2 

A D A D D D 

D A D A A A 

So D wins 15-9. 

Practice Exercise. Repeat the above question for the initial preference profile 


6 

1 

1 

1 

2 

1 

5 

2 

A 

B 

A 

C 

D 

D 

B 

D 

C 

D 

D 

D 

C 

C 

D 

B 

D 

A 

C 

A 

A 

B 

C 

C 

B 

C 

B 

B 

B 

A 

A 

A 


The Condorcet Method 

A multiple use of runoff elections was discussed by Marie Jean Antoine Nicolas 
Caritat, Marquis de Condorcet, an eighteenth Century French mathematician and 
political theorist. (Similar ideas were proposed by Ramon Llull as long ago as 1299.) 

Suppose we simultaneously conduct all the “runoff” elections among our candi¬ 
dates. For example, in the election discussed in Sample Problem 10.2, there are six 
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runoffs: A versus B , A versus C, A versus D , B versus C, B versus D , and C ver¬ 
sus D. If any one candidate wins all his/her runoffs, then surely you would consider 
that person a winner. Such a candidate is a Condorcet winner. 

In Sample Problem 10.1, we find: 

B beats A 10-9, 

A beats C 16-3, 

B beats C 12-7, 

so B is a Condorcet winner. Similarly, in Sample Problem 10.2, D is a Condorcet 
winner. But even in the simple example 


3 

2 

2 

A 

B 

C 

B 

C 

A 

C 

A 

B 


A beats B 5-2, B beats C 5-2, and C beats A 4-3, so there is no Condorcet winner. 

In elections with several candidates, it is very common to have no Condorcet 
winner, even when there are no ties. This is a serious fault in the Condorcet method. 

Sample Problem 10.3. Consider the election with preference profile 


5 

4 

3 

A 

B 

C 

B 

C 

A 

C 

A 

B 


Who would win under the Hare method? Is there a Condorcet winner? 

Solution. The votes for A, B , C, and D are 5, 4, and 3, respectively. Under the 
Hare method C is eliminated. The new preference profile is 


5 

4 

3 

A 

B 

A 

B 

A 

B 


So A wins 8-4. Looking at all three runoffs, we see that A beats B 8-4, B beats 
C 9-3, and C beats A 7-5, so there is no Condorcet winner. 


Point Methods 

Pointscore methods have often been used in sporting contests. For example, they are 
commonly used in track meets and in motor racing. When the Olympic Games are 
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being held, many newspapers publish informal medal tallies to rank the performance 
of the competing nations—the usual method is to allocate three points for a gold 
medal, two for a silver, one for a bronze, and then add. 

In general, a fixed number of points are given for first, second, and so on. The 
points are totalled and the candidate with the most points wins. If there are n com¬ 
petitors, a common scheme is to allocate n points to first, n — 1 to second, ..., or 

equivalently n — 1 to first, n — 2 to second,_This case, where the points go in 

uniform steps, is called a Borda count. 

One often see scales like 5, 3, 2, 1, where the winner gets a bonus, or 3, 2, 1, 
0, 0, ... (that is, all below a certain point are equal). Sometimes more complicated 
schemes are used; for example, in the Indy Racing League, the following system has 
been used: 


1st gets 
2nd gets 
3rd gets 
4th gets 


20 5th gets 10 

16 6th gets 8 

14 7th gets 6 

12 8th gets 5 

Fastest qualifier gets one point. 
Leader of most laps gets one point. 


9th gets 4 
10th gets 3 
11th gets 2 
12th gets 1 


Pointscore methods are occasionally employed for elections, most often for small 
examples such as the selection of the best applicant for a job. 

Sometimes the result depends on the point scheme chosen. 


Sample Problem 10.4. What is the result of an election with preference table 


5 

7 

4 

3 

A 

B 

A 

C 

C 

C 

C 

B 

B 

A 

B 

A 


if a 3, 2, 1 count is used? What is the result if a 4, 2, 1 count is used? 

Solution. With a 3, 2, 1 count the totals are A : 37, B : 36, C : 41, so C wins. 
With a 4, 2, 1 count the totals are A : 46, B : 43, C : 44, and A wins. 

Exercises 10.1 

1. Eighty-six electors vote between candidates A, B, and C. Their votes are 25 
for A, 48 for B , and 13 for C. What is the result under the majority method? 
What is the result under the plurality method? 

2. Twenty-eight electors vote between candidates A, B, and C. Their votes are 5 
for A, 14 for B, and 9 for C. What is the result under the majority method? What 
is the result under the plurality method? 
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3. Seventy-five electors vote between candidates A, B, and C. Twenty-five vote for 
A and 28 vote for B. How many votes did C receive? What is the result under 
the majority method? What is the result under the plurality method? 

4. Fifty-four electors vote between candidates A, B, and C. A receives 24 votes 
while B and C received equal numbers. How many did they each receive? What 
is the result under the majority method? What is the result under the plurality 
method? 

5. Suppose 73 voters must decide between candidates A, B, C. There are 14 votes 
for A. How many votes must B receive to win if the majority method is used? 
How many will suffice under the plurality method? 

6. Suppose 50 voters must decide between candidates X , Y, Z. There are 20 votes 
for X. How many votes must Y receive to win if the majority method is used? 
How many will suffice under the plurality method? 

7. First-year students in a university vote for Class president. There are three can¬ 
didates, Smith, Jones, and Brown. The preference table is 


25 

27 

14 

22 

24 

11 


S 

J 

J 

B 

B 

J 

B 

S 

B 

S 

J 

B 

J 

B 

S 

J 

S 


(i) How many students voted? 

(ii) How many first place votes did each candidate receive? 

(iii) Who, if anybody, would win under the plurality method? 

(iv) Who, if anybody, would win under the majority method? 

8. At the Academy Awards there are three nominees for Best Actor: Arthur An¬ 
drews, Bob Brown, and Clive Carter. The preference table is 


123 

101 

124 

308 

222 

305 

A 

A 

B 

B 

C 

C 

B 

C 

A 

C 

A 

B 

C 

B 

C 

A 

B 

A 


(i) How many actors voted? 

(ii) How many first place votes did each candidate receive? 

(iii) Who, if anybody, would win under the plurality method? 

(iv) Who, if anybody, would win under the majority method? 

9. In the election described in Exercise 10.1.7, who would win under the runoff 
method? 
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10. In the election described in Exercise 10.1.8, who would win under the runoff 
method? 

11. Twenty-three electors vote for three candidates, resulting in the preference table 


8 

6 

5 

4 

Z 

Y 

Z 

Y 

Z 

Z 

Z 

X 

Y 

X 

Y 

Z 


(i) Who, if anybody, would win under the majority method? 

(ii) Who, if anybody, would win under the plurality method? 

(iii) What would be the result if a Borda count were used? 

(iv) Would any of these results change if the five voters who preferred Z were to 
exchange their second and third preferences? 

12. In the election of Exercise 10.1.11, it is decided to use a scoring system where 
first place gets n points, second gets 2, and third gets 1 where n is some whole 
number greater than 2. For what ranges of values is X winner? What is the range 
for y? For Z? 

13. In how many ways can a voter rank three candidates if no ties are allowed? In 
how many ways can it be done if ties are allowed? 

14. In how many ways can a voter rank four candidates if no ties are allowed? In 
how many ways can it be done if ties are allowed? 

15. Suppose there are three candidates in an election. Is there any difference whether 
the runoff method or the Hare method is used? 

16. Verify that, in the election described in Sample Problem 10.2, D is a Condorcet 
winner. 

17. Fifteen students vote on three restaurants for a group dinner. Their choices are 
Chinese, Italian, and Thai. If their preference table is 


6 

5 

4 

C 

I 

T 

I 

T 

I 

I 

C 

C 


(i) What will be their decision under the plurality method? 

(ii) What will be their decision using a Borda count? 

(iii) Is there a Condorcet winner? 

18. Nineteen sports announcers rank the final four in a basketball tournament. The 
teams are Creighton (C), Drake (D), Evansville ( E ), and Southern Illinois ( S ). 
Their preference table is 
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9 

3 

2 

5 

s 

C 

E 

C 

c 


D 

E 

E 

D 

C 

D 

D 

E 

S 

S 


Which team will they choose as the favorite if they use the following methods? 

(i) Plurality. 

(ii) A Borda count. 

(iii) The Hare method. 

19. There are four candidates for club president; we denote them A, B, C, D. The 
17 members rank them as follows: 


1 

6 

4 

c 

D 

A 

B 

C 

B 

A 

B 

D 

D 

A 

C 


The club by-laws specify that the Hare system should be used. 

(i) Who is elected? 

(ii) Just before the election, candidate C withdraws from the race. Who is 
elected in that case? 

20. Twenty-one electors wish to choose between four candidates A, B, C, D. The 
preference table is 


8 

9 

4 

C 

B 

A 

B 

C 

B 

A 

D 

C 

D 

A 

D 


(i) Who is elected under the plurality method? 

(ii) Who is elected under the Hare method? 

21. Suppose the electors in Exercise 10.1.20 decide to use a pointcount method. 

(i) Who would win under the (4, 3,2, 1) Borda count? 

(ii) Who would win if a (5, 3, 1,0) count is used? 

22. A 13-member committee must select from candidates A, B, C, D. Their prefer¬ 


ences are 
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6 

3 

4 

A 

C 

C 

B 

B 

D 

D 

A 

B 

C 

D 

A 


(i) Who is elected under the plurality method? 

(ii) Who is elected under the Hare method? 

(iii) Who would win under a Borda count? 

23. Given the following preference table, who would win under plurality voting? 
Who would win in a runoff? 


6 

3 

4 

2 

1 

A 

C 

C 

B 

E 

E 

B 

D 

A 

A 

B 

E 

A 

C 

B 

C 

D 

E 

D 

C 

D 

A 

B 

E 

D 


Is there a Condorcet winner? 

In Exercises 24 to 26, a preference table is given. Who would win in a Borda count? 
Is there a Condorcet winner? 


24. 


25. 


6 

4 

4 

A 

B 

C 

B 

C 

B 

C 

A 

A 


3 

4 

2 

5 

4 

A 

A 

B 

B 

C 

B 

C 

A 

C 

B 

C 

B 

C 

A 

A 


2 

3 

4 

2 

D 

C 

B 

A 

C 

A 

D 

B 

A 

D 

C 

C 

B 

B 

A 

D 


27. Construct a preference profile for four voters and four candidates such that three 
voters prefer W to X, three prefer X to Y, three prefer Y to Z, and three prefer 
Z to W. 
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10.2 Multiple Elections 

The Generalized Hare Method 

In this section we discuss cases where several candidates are to be elected simulta¬ 
neously. 

The Hare system was modified by Andrew Inglis Clark who was Attorney Gen¬ 
eral of Tasmania in the late 19th century. This generalized Hare method , or quota 
method , is used in many countries, including Ireland, Australia, and Malta. 

As an example, suppose 24,000 people are to elect five representatives from a 
larger number of candidates. 

Each person makes a preferential vote, listing all the candidates in order. (In 
other words, they record all of their preference profiles, not just the first members.) 
The first name on the list is called the voter’s first preference , and so on. 

The quota is 4000 votes. (This number is chosen because you cannot have six 
candidates who each get more than 4000 first preferences.) Every candidate who 
gets more than 4000 votes is elected. If a candidate gets more than 4000 votes, the 
remainder of those votes go to voters’ second choices, divided proportionally. To 
illustrate this, suppose A gets 5000 votes. Of these, 2500 have B as second choice, 
2000 have C and 500 have D. Since this is greater than 4000, A is declared elected. 
The 1000 surplus votes are divided in the proportion 2500 : 2000 : 500, or 50% : 
40% : 10%. That is, B gets 50% of A’s preferences because 2500 is 50% of 5000; 
C gets 40%; and D gets 10%. As we said, A has exceeded the quota by 1000 votes, 
so we say he has 1000 preferences to be distributed. B gets 50% of these; 500 more 
votes are added to B’s total. In the same way C gets 400 added votes (40%) and D 
gets 100 votes (10%). 

Now we check whether B, C, or D has exceeded the quota. For example, if B 
previously had 3700 votes, the new total would be 4200, exceeding the quota again, 
so B is declared elected. 

If not enough candidates have been elected, and no one has enough votes, the 
remaining candidate with the fewest votes is eliminated. All of that candidate’s pref¬ 
erences are distributed (in just the same way as in the regular Hare method: no per¬ 
centages need to be calculated). 

In general, suppose there are V voters and N places to be filled. The quota is 

V 

N+l' 

and a candidate is declared to be elected if his or her number of votes exceeds the 
quota. Notice that, if only one candidate were to be elected, the quota requirement 
is that a candidate receive a majority of the votes. (Some systems use the formula 
^q-j- + 1 and say a candidate is elected if the quota is equalled or exceeded. This 
may require one more vote in the case where -^y is not an integer.) 
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In a real example, the whole list of preferences is kept. The process may require 
a great deal of data; moreover it may result in complicated numbers, fractions, and 
so on. Historically this was a serious problem and caused long delays in announcing 
the results of elections, but it is no longer an issue now that computers are available 
and voting machines can be adapted to keep all the data. 

For example, given a preference profile of the form 


12 

9 

6 



A 

A 

A 

B 


B 

B 

C 



C 

D 

B 



D 

C 

D 




and a quota of 18, A’s surplus is nine. The surplus is divided 7 : 2 between B and C, 
but if we divide it 4 : 3 : 2 between the first three columns later calculations will be 
easier. 

Sample Problem 10.5. Say there are five candidates for three positions; the pref¬ 
erence table is 


6 

6 

9 

6 

3 

2 

A 

A 

C 

c 

E 

E 

B 

B 

D 

D 

C 

A 

E 

D 

E 

E 

D 

B 

D 

E 

A 

B 

A 

C 

C 

C 

B 

A 

B 

D 


Who will be elected? 

Solution. There are 32 voters so the quota is 8. A gets 12 first place votes and C 
gets 15, both of which exceed the quota. So A and C are elected. 

A has a surplus of 4. The second-place candidate in every case in B , but we 
observe that the votes are divided in proportion 6:6 between A’s two lists, so we 
give two votes to each of A’s lists. C is also at the top of two lists, and has a 
surplus of 7, divided 9 : 6. This gives surplus allocations of 4.2 and 2.8. After 
these allocations, the new table is 


2 

2 

4.2 

2.8 

3 

2 

B 

B 

D 

D 

E 

E 

E 

D 

E 

E 

D 

B 

D 

E 

B 

B 

B 

D 


B has four votes, D has 7, and E has 5. No one meets the quota. (Note that the 
quota does not change.) So B (who had the fewest votes) is eliminated. We now 
have 
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2 

2 

4.2 

2.8 

3 

2 

E 

D 

D 

D 

E 

E 

D 

E 

E 

E 

D 

D 


D now has 9 votes and E has 7, so D is elected. In total, A, C, and D are elected. 
Practice Exercise. Repeat the above problem for preference profile 


5 

5 

7 

8 

5 

2 

A 

A 

B 

C 

D 

E 

B 

E 

D 

D 

C 

A 

E 

B 

E 

E 

E 

B 

D 

D 

A 

B 

A 

C 

C 

C 

C 

A 

B 

D 


Approval Voting 

Approval voting was first used in cases where the number of people to be elected 
was not fixed. For example, if a committee wants to co-opt a few more members 
to help organize a function, they might add all those who they think would make a 
positive contribution, not a fixed number. In that case a member might vote for every 
suitable candidate, casting as many votes as she wishes; all the votes are considered 
equal. The number of votes received by a candidate is called his approval rating. 
Candidates with an approval rating of at least 50% (or 60%, or some other agreed 
figure) are elected. 

Sometimes there is a strict requirement that a certain number be elected, or a min¬ 
imum or maximum is imposed. Ties are possible, and some sort of runoff procedure 
may be necessary. The following example illustrates these ideas. 

Sample Problem 10.6. Ten board members vote on eight candidates by ap¬ 
proval. The candidates are A, B, C, D, E, E, G, H, and the board members 
are q, r, s, t, u,v, w, x, y, z- They vote as follows (x represents approval): 



q 

r 

s 

t 

u 

V 

w 

V 

y 

z 

A 

X 

X 

X 


X 


X 

X 


X 

B 


X 

X 

X 

X 

X 

X 

X 

X 


C 



X 





X 



D 

X 


X 

X 

X 

X 

X 

X 


X 

E 

X 

X 

X 

X 

X 



X 



F 

X 

X 

X 

X 

X 

X 

X 


X 

X 

G 

X 


X 

X 



X 


X 


H 


X 


X 

X 

X 




X 


(i) Which candidate is chosen if just one is to be elected? 

(ii) Which candidate is chosen if the top four are to be elected? 
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(iii) Which candidate is chosen if the top two are to be elected? 

(iv) Which candidate is chosen if at most four are to be elected and 80% approval 
is required? 

Solution. Here is a summary of the votes received: 

A-7, B- 8, C — 2, D- 8, E- 7, F — 9, G - 5, H —1. 

(i) F , who received the most votes. 

(ii) A, B , D , Z 7 . 

(iii) Z 7 is elected; there will need to be a runoff election between B and D. 

(iv) B, D, and Z 7 are elected. 

The array in the example is called an approval table ; we shall always denote 
approval with a cross in such tables. 

Approval voting is particularly useful for situations like the selection of new em¬ 
ployees. In those circumstances there is usually a minimum requirement, and further 
applications may be called if not enough good applicants are available. 


The Single Transferable Vote 

The single transferable vote (or STV) system is a variation of the generalized Hare 
method that contains some aspects of approval voting. Every elector decides whether 
or not a candidate is approved, and supplies a preference profile of the approved can¬ 
didates. In the preference table, some columns will have blank spaces at the bottom. 

There are various ways in which an STV system can be implemented. Typically, 
any vote that lists no approved candidates is eliminated before the quota is calculated. 
Then elections proceed similarly to elections under the generalized Hare method. 
However, it is possible for a vote to disappear because all the candidates named on 
it have been eliminated. In the basic system this can mean that no candidate reaches 
the quota, even though not all vacancies have been filled; the standard procedure is 
to hold another election for the remaining seats. However, another possibility is to 
recalculate the quotas at each stage. We shall refer to this as dynamic STV. 

Sample Problem 10.7. An election for two positions is held under the single 
transferable vote system; the preference table is 


5 

5 

5 

5 

3 

1 

A 

A 

A 

A 

B 

C 

B 

C 



C 

B 

C 

B 



A 

A 


What is the result under the basic STV system? What is the result under dynamic 
STV? 
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Solution. There are 24 electors, so the quota is 8. A is elected and has 12 surplus 
votes. The new preference table is 


3 

3 3 

3 3 

1 

6 

4 6 

B 

C 

B 

C 

or equivalently B 

C 

C 

B 

C 

B 

C 

B 


B has six votes and C has four; neither is elected and a new election is held for 
the remaining position. 

Under the dynamic system, the new preference table is treated as if it were the 
preference table for a new election; the “empty” ballots are discarded, and we 
proceed as though there were 10 votes with preference table 


6 

4 

B 

C 

C 

B 


B is elected. 


Exercises 10.2 


In Exercises 1 to 4, a preference table is shown for an election where two candidates 
are to be elected using the generalized Hare method. In each case, what is the quota ? 
What is the outcome of the election? 


1 . 


2 . 


3. 


7 

8 

7 

5 

A 

B 

C 

D 

B 

D 

B 

C 

C 

C 

A 

B 

D 

A 

D 

A 


3 

5 

8 

8 

6 

A 

A 

B 

C 

D 

B 

C 

D 

B 

C 

C 

D 

C 

A 

B 

D 

B 

A 

D 

A 


4 

6 

6 

6 

2 

6 

A 

D 

B 

E 

C 

B 

D 

A 

D 

A 

E 

A 

E 

E 

C 

D 

D 

E 

B 

B 

E 

C 

A 

C 

C 

C 

A 

B 

B 

D 
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6 

7 

7 

1 

1 

5 

A 

B 

C 

D 

C 

E 

E 

D 

B 

E 

D 

B 

D 

A 

D 

A 

E 

A 

B 

E 

E 

C 

A 

C 

C 

C 

A 

B 

B 

D 


In Exercises 5 to 10, a preference table is shown for an election where three candi¬ 
dates are to be elected using the generalized Hare method. In each case, what is the 
quota? What is the outcome of the election? 


4 

1 

9 

8 

1 

5 

A 

A 

B 

C 

D 

D 

B 

B 

D 

D 

C 

A 

C 

D 

C 

A 

B 

B 

D 

C 

A 

B 

A 

C 


24 

16 

19 

21 

10 

10 

A 

B 

c 

D 

E 

E 

B 

A 

D 

C 

C 

A 

E 

D 

E 

E 

D 

B 

D 

E 

A 

B 

B 

D 

C 

C 

B 

A 

A 

C 


6 

6 

6 

6 

5 

3 

A 

A 

c 

c 

E 

E 

B 

D 

D 

E 

C 

A 

E 

E 

E 

D 

D 

B 

D 

B 

A 

B 

A 

C 

C 

C 

B 

A 

B 

D 


9 

8 

7 

6 

4 

2 

A 

B 

D 

D 

C 

A 

E 

D 

E 

E 

D 

B 

D 

A 

C 

A 

E 

E 

B 

E 

B 

C 

A 

C 

C 

C 

A 

B 

B 

D 


6 

6 

7 

5 

6 

5 

5 

A 

B 

B 

C 

D 

E 

E 

B 

E 

D 

E 

E 

B 

B 

D 

A 

A 

B 

A 

C 

A 

C 

C 

E 

A 

B 

D 

D 

E 

D 

C 

D 

C 

A 

C 
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10 . 


8 

6 

7 

1 

9 

3 

8 

A 

A 

B 

C 

c 

D 

E 

D 

B 

C 

D 

E 

C 

A 

E 

D 

E 

E 

D 

B 

B 

B 

E 

A 

A 

A 

E 

D 

C 

C 

D 

B 

B 

A 

C 


11. Eight administrators s, t, u, v, w, x, y, z are voting to fill one position in senior 
management from candidates A, B , C, D , E , F. They use an approval system; 
their approval table is 



s 

t 

u 

V 

w 

X 

y 

z 

A 

X 

X 




X 



B 

X 


X 

X 

X 



X 

C 


X 


X 



X 

X 

D 


X 


X 

X 


X 


E 



X 



X 

X 


F 


X 

X 

X 



X 

X 


(i) What is the outcome? 

(ii) What is the outcome if at least 80% approval rating is required? 

(iii) What is the outcome if at least 60% approval rating is required? 

(iv) The administrators are told that they can appoint more than one person if 
they wish. What is the outcome if at least 60% approval rating is required? 

In Exercises 12 to 17, an approval table is shown for selection of at most three 
candidates. In each case 

(i) What is the outcome if there is no minimum requirement? 

(ii) How many votes are needed if at least 66% approval is required? 

(iii) What is the outcome if at least 66% approval is required? 



r 


t 

u 

V 

w 

V 

y 

z 

A 

X 

X 




X 




B 

X 


X 

X 

X 

X 



X 

c 


X 


X 



X 

X 

X 

D 


X 


X 

X 

X 


X 


E 



X 

X 



X 

X 


F 

X 


X 

X 

X 



X 

X 

G 



X 

X 



X 

X 


H 

X 


X 

X 

X 



X 

X 


12 . 
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13. 



P 

q 

r 


t 

u 

V 

w 

X 

y 

z 

A 

X 

X 


X 

X 

X 


X 




B 

X 

X 

X 


X 

X 

X 

X 



X 

C 


X 


X 


X 



X 

X 

X 

D 


X 


X 

X 

X 


X 


X 


E 

X 

X 

X 

X 

X 

X 

X 


X 

X 

X 

F 



X 

X 




X 

X 

X 


G 



X 


X 

X 



X 

X 


H 

X 


X 


X 

X 

X 



X 

X 


14. 



P 

q 

r 


t 

u 

V 

w 

X 

y 

z 

A 


X 

X 

X 

X 

X 

X 

X 


X 


B 


X 

X 


X 

X 


X 



X 

c 









X 

X 

X 

D 


X 


X 

X 

X 


X 


X 


E 

X 


X 

X 





X 


X 

F 




X 




X 

X 



G 





X 

X 



X 



H 

X 

X 

X 

X 

X 

X 

X 



X 

X 



q 

r 


t 

u 


V 

w 

X 

y 

z 

A 

X 


X 


X 



X 

X 



B 

X 



X 

X 


X 

X 



X 

C 


X 

X 


X 


X 


X 

X 

X 

D 


X 


X 

X 


X 

X 


X 


E 

X 

X 

X 

X 

X 


X 


X 

X 

X 

F 



X 

X 




X 

X 

X 


G 



X 


X 


X 


X 

X 


H 

X 


X 





X 


X 

X 

I 



X 


X 


X 



X 


J 

X 



X 

X 


X 

X 



X 


16. 



q 

r 

5 

t 

u 

V 

w 

X 

y 

z 

A 

X 



X 

X 

X 

X 



X 

B 


X 


X 

X 


X 

X 

X 


c 



X 


X 

X 

X 


X 


D 

X 

X 

X 


X 


X 

X 



E 

X 


X 

X 

X 

X 


X 


X 

F 



X 




X 


X 

X 

G 



X 

X 

X 



X 

X 


H 

X 


X 


X 

X 



X 


I 

X 

X 

X 


X 

X 


X 


X 

J 




X 

X 

X 

X 



X 
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17. 



o 

p 

q 

r 


t 

u 

V 

w 

V 

y 

z 

A 


X 

X 



X 

X 

X 

X 



X 

B 


X 


X 


X 

X 


X 

X 

X 


C 



X 


X 

X 

X 


X 


X 


D 

X 

X 


X 

X 


X 


X 

X 



E 

X 


X 


X 

X 


X 


X 


X 

F 



X 






X 


X 

X 

G 

X 

X 



X 


X 



X 

X 


H 

X 


X 


X 

X 





X 


I 

X 


X 

X 

X 


X 

X 


X 


X 

J 



X 



X 


X 

X 



X 

K 




X 

X 

X 

X 

X 

X 



X 

L 


X 

X 

X 

X 

X 

X 


X 

X 

X 



18. Suppose three candidates are to be elected. Voters have the following preference 
profile (if a candidate appears on a list, one may assume the voter approves of 
that candidate): 


3 

5 

7 

4 

8 

5 

A 

B 

C 

D 

G 

E 

E 

D 

F 

F 

D 

B 

D 

F 

D 

A 

E 

A 

B 

E 

E 

C 

F 




A 

B 

B 





G 

C 





E 

A 



(i) What is the result under approval voting? 

(ii) What is the result under the basic STV system? 

(iii) What is the result under dynamic STV? 

19. An election is to be held under the STV system and four candidates are to be 
elected. The preference table is 


1 

7 

4 

2 

4 

6 

A 

A 

B 

C 

D 

E 

B 

C 

D 

B 

B 

C 


D 

A 

A 

A 

D 


E 


D 


A 


B 


E 


B 


(i) What is the result under the basic STV system? 

(ii) What is the result under dynamic STV? 
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20. An election is to be held under the STV system and three candidates are to be 
elected. The preference table is 


7 

7 

4 

2 

4 

6 

A 

B 

C 

D 

B 

F 

B 

A 

B 

B 

A 

C 



D 

A 

E 

E 



A 

C 

D 




E 

E 




(i) What is the quota? 

(ii) What is the result under the basic STV system? 

(iii) What is the result under dynamic STV? 

10.3 Fair Elections 

We have seen several different electoral systems. Even in a small election, the same 
preferences can give rise to a different results depending on the system used. In this 
section we shall present an extreme case of how the method chosen might affect the 
outcome of an election in a realistic situation. This means that an electoral system 
can be chosen to favor one or another candidate. 

There are also many instances in which voters can misrepresent their preferences, 
manipulating an election to give a certain result. Some instances of this are given 
below and another example will be discussed in Section 10.4. 


Five Candidates, Five Winners 

Consider a political party convention at which five different voting schemes are 
adopted. Assume that there are 110 delegates to this national convention, at which 
five of the party members, denoted by A, B, C, D, and E , have been nominated as 
the party’s presidential candidate. Each delegate must rank all five candidates ac¬ 
cording to his or her choice. Although there are 5! =: 5x4x3x2x 1 = 120 
possible rankings, many fewer will appear in practice because electors typically split 
into blocs with similar rankings. Let’s assume that our 110 delegates submit only six 
different preference lists, as indicated in the following preference profile: 
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Number of delegates 


36 

24 

20 

18 

8 

4 

First choice 

A 

B 

C 

D 

E 

E 

Second choice 

D 

E 

B 

C 

B 

C 

Third choice 

E 

D 

E 

E 

D 

D 

Fourth choice 

C 

C 

D 

B 

C 

B 

Fifth choice 

B 

A 

A 

A 

A 

A 


The 36 delegates who most favor nominee A rank D second, E third, C fourth, 
and B fifth. Although A has the most first-place votes, he is actually ranked last 
by the other 74 delegates. The 12 electors who most favor nominee E split into two 
subgroups of 8 and 4 because they differ between B and C on their second and fourth 
rankings. 

We shall assume that our delegates must stick to these preference schedules 
throughout the following five voting agendas. That is, we will not allow any dele¬ 
gate to switch preference ordering to vote in a more strategic manner or because of 
new campaigning. 

We report the results when six popular voting methods are used. There are six 
different results. 

1. Majority. As one might expect with five candidates, there is no majority winner. 

2. Plurality. If the party were to elect its candidate by a simple plurality, nominee 

A would win with 36 first-place votes, in spite of the fact that A was favored 
by less than one-third of the electorate and was ranked dead last by the other 74 
delegates. 

3. Runoff. On the other hand, if the party decided that a runoff election should be 

held between the top two contenders (A and B ), who together received a majority 
of the first-place votes in the initial plurality ballot, then candidate B outranks A 
on 74 of the 110 preference schedules and is declared the winner in the runoff. 

4. Hare Method. Suppose the Hare method is used: a sequence of ballots is held and 

eliminates at each stage the nominee with the fewest first-place votes. The last 
to survive this process becomes the winning candidate. In our example, E , with 
only 12 first-place votes, is eliminated in the first round. E can then be deleted 
from the preference profile and all 110 delegates will vote again on successive 
votes. On the second ballot, the 12 delegates who most favored E earlier now 
vote for their second choices, that is, 8 for B and 4 for C; the number of first- 
place votes for the four remaining nominees is 

A B C D 
36, 32, 24, 18. 

Thus, D is eliminated. On the third ballot the 18 first-place votes for D are 
reassigned to C, their second choice, giving 






ABC 
36, 32, 42. 
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Now B is eliminated. On the final round, 74 of the 110 delegates favor C over 
A, and therefore C wins. 

5. Borda count. Given that they have the complete preference schedule for each 

delegate, the party might choose to use a straight Borda count to pick the winner. 
They assign five points to each first-place vote, four points for each second, three 
points for a third, two points for a fourth, and one point for a fifth. The scores 
are 

A: 254 = (5) (36) + (4)(0) + (3)(0) + (2)(0) + (1)(24 + 20+ 18 + 8 + 4), 
B: 312 = (5) (24) + (4) (20 + 8) + (3)(0) + (2)(18 + 4) + (1)(36), 

C: 324 = (5) (20) + (4)(18 + 4) + (3)(0) + (2)(36 + 24 + 8) + (1)(0), 

D: 382 = (5) (18) + (4)(36) + (3)(24 + 8 + 4) + (2)(20) + (1)(0), 

E: 378 = (5)(8 + 4) + (4) (24) + (3)(36 + 20 + 18) + (2)(0) + (1)(0). 

The highest total score of 382 is achieved by D , who then wins. A has the lowest 
score (254) and B the second lowest (312). 

6. Condorcet. With five candidates there is often no Condorcet winner. However, 

when we make the head-to-head comparisons, we see that E wins out over 

• A by a vote of 74 to 36, 

• B by a vote of 66 to 44, 

• C by a vote of 72 to 38, 

• D by a vote of 56 to 54. 

So there is a Condorcet winner, namely E. 

In summary, our political party has employed five different common voting pro¬ 
cedures and has come up with five different winning candidates. We see from 
this illustration that those with the power to select the voting method may well 
determine the outcome. 


Sequential Pairwise Voting 

In sequential pairwise voting, several candidates are paired in successive runoff elec¬ 
tions. There is an agenda (an ordered list of candidates). For example, if the agenda 
is A, B, C, D, ..., then the elections proceed as follows: 

1. A against B. 

2. Winner of A B against C. 

3. That winner against D. 
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Position in the agenda is very important. To see this, consider a four-candidate 
election with agenda A, B, C, D, in which all four candidates are equally likely to 
win. If repeated trials are made then we would expect the following results: 

• A wins first runoff in half the cases. 

• A wins second runoff in half those cases—a quarter overall. 

• A wins the third runoff in half those cases—one-eighth overall. 

So A has a 1 in 8 chance of winning. B also has a has 1 in 8 chance. However, 
C has a 1 in 4 chance, and D has a 1 in 2 chance. In this case being later in the list is 
very beneficial. 

Rather than elections, this model is often used for sporting tournaments (the re¬ 
sult of a match is used instead of the result of a runoff election). One often sees 
playoff rules like 

(i) Second and third place getters in preliminary competition play each other (“the 
playoff”). 

(ii) The winner of the playoff meets the leader from the preliminaries. 

In this case it is reasonable that the preliminary leader should get an advantage. How¬ 
ever, when the model is used in voting situations it is very subject to manipulation. 


Manipulating the Vote 

The term strategic voting means voting in a way that does not represent your actual 
preferences to change the result of the election. We would call the resulting ballot 
insincere. 

Suppose your favorite is candidate X. (We will call you an X supporter.) Then 
X would normally appear at the top of your preference list. But sometimes you can 
achieve X’s election by voting for another candidate in first place! This is most com¬ 
mon in runoff situations; you can ensure that your candidate does not have to face a 
difficult opponent. The following example illustrates this. 

Sample Problem 10.8. A runoff election has preference profile 


6 

2 

1 

5 

4 

A 

C 

C 

B 

D 

D 

B 

A 

A 

A 

C 

D 

D 

C 

B 

B 

A 

B 

D 

C 


Show that the supporters of C can change the result so that their candidate wins, 
hy the two voters in the second column changing their ballots by demoting their 
candidate. 
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Solution. Initially the first-place votes are A-6, B- 5, C-9, D- 4, so the runoff 
election will be between A and C, and A wins 15-9. The revised profile is 


6 

2 

7 

5 

4 

A 

B 

C 

B 

D 

D 

C 

A 

A 

A 

C 

D 

D 

C 

B 

B 

A 

B 

D 

C 


the first-place votes are A-6, B-l , C-7, Z)-4, so the runoff election is between 
A and C, and C wins 13-11. 

Even when you cannot ensure victory for your opponent, you may still be able to 
obtain a preferable result. For example, suppose you support candidate X ; you think 
candidate Y is acceptable, but hate candidate Z. Even if insincere voting cannot 
ensure victory for candidate X , you may be able to swing the election to Y rather 
than Z. 

Sample Problem 10.9. An election with four candidates and seven voters is to 
he decided hy the Hare system. The preference profile is 


2 

1 

2 

1 

2 

B 

B 

D 

C 

C 

A 

D 

C 

A 

D 

D 

A 

B 

B 

A 

C 

C 

A 

D 

B 


Show that one of the two voters with B, A, D, C can change the outcome to a 
more favorable one by insincere voting. 

Solution. First consider the result of sincere voting. Initially A is eliminated, 
having no first-place votes 


2 12 12 
B B D C C 

D D C B D 

C C B D B 

Next D is eliminated, leaving 


2 12 12 

B B C C C 

C C B B B 


The winner is C. 

Now suppose one voter changes his ballot from B, A, D, C to D, A, B, C . The 
profile is 
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1 

1 

1 

2 

1 

2 

B 

D 

B 

D 

C 

C 

A 

A 

D 

C 

A 

D 

D 

B 

A 

B 

B 

A 

C 

C 

C 

A 

D 

B 


Again A is first eliminated, leaving 

1 

1 

1 

2 

1 

2 

B 

D 

B 

D 

C 

C 

D 

B 

D 

C 

B 

D 

C 

C 

C 

B 

D 

B 

In the next round, B is eliminated 





1 

1 

1 

2 

1 

2 

D 

D 

D 

D 

C 

C 

C 

C 

C 

C 

D 

D 


The winner is D. This is a preferable outcome for the voter who switched. 
Practice Exercise. Consider a Hare system election with preference profile 


8 

6 

5 

P 

Q 

R 

Q 

R 

S 

R 

p 

P 

s 

s 

Q 


Show that P would win this election. Show that if one of the six supporters of Q 
changes her vote, she could ensure that R wins, even though a majority of voters 
still prefer Q to R. 

Of course, you do not always know exactly how the votes will go. Strategic voting 
is usually based on assumptions about the election. 


Amendments 

We now consider an important example of manipulation of pairwise sequential vot¬ 
ing. Suppose three voters on City Council have to decide whether to add a new sales 
tax. Initially, 

• A prefers the tax, 

• B prefers the tax, 
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• C prefers no tax, 
so a tax will be introduced. 

However, let’s assume A hates income taxes and will never vote for one. On the 
other hand, B prefers income tax to sales tax. Suppose C moves an amendment to 
change the tax to an income tax. 

We now have the following outcomes. 

Original motion: that a city sales tax of 5% be introduced. 

Amendment (moved by C): change “sales tax of 5%” to “income tax of 2% 

(We will assume the 2% income tax will provide the same total as the 5% sales 
tax.) 

In the vote on the amendment, both B and C will vote in favor, with A against, 
so the amendment is carried. So the motion becomes the following: a city income tax 
of 2% shall be introduced. In the vote on the new motion, both A and C are against, 
while B votes in favor; so the motion is lost and there is no tax. 

Exercises 10.3 

1. Consider the preference profile 


3 

1 

1 

1 

1 

1 

1 

W 

W 

X 

X 

Y 

Y 

z 

z 

Y 

Y 

z 

X 

X 

X 

Y 

Z 

Z 

Y 

Z 

W 

Y 

X 

X 

W 

W 

w 

Z 

W 


What is the result of an election using the following systems? 

(i) Majority system. 

(ii) Plurality system. 

(iii) Borda count. 

(iv) Hare system. 

(v) Condorcet method. 

(vi) Pairwise sequential method with agenda X, Y, Z, W. 

(vii) Pairwise sequential method with agenda W, X, Z, Y. 
(viii) Pairwise sequential method with agenda Y, Z, W, X. 

2. Consider the preference profile 
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2 

1 

1 

1 

2 

A 

A 

B 

C 

D 

C 

D 

D 

D 

C 

B 

C 

C 

B 

A 

D 

B 

A 

A 

B 


What is the result of an election using the following systems? 

(i) Majority system. 

(ii) Plurality system. 

(iii) Runoff system. 

(iv) Borda count. 

(v) Hare system. 

(vi) Condorcet method. 

3. A committee of 11 members needs to elect one representative from four candi¬ 
dates. They plan to use a Borda count. The preferences are 


4 

2 

2 

3 

A 

A 

B 

B 

C 

C 

C 

D 

D 

C 

A 

C 

B 

D 

D 

A 


(i) Who would win the election if all electors vote sincerely? 

(ii) Can the voters in the third column vote insincerely so as to change the result 
in their favor? If so, how? 

(iii) Can the voters in the fourth column vote insincerely so as to change the 
result in their favor? If so, how? 

4. Suppose an election with the following preference profile is decided by the Hare 
system. 


3 

6 

1 

8 

6 

A 

A 

B 

C 

D 

B 

C 

D 

D 

C 

C 

D 

A 

A 

B 

D 

B 

C 

B 

A 


(i) Who will win the election? 

(ii) Show that the seven supporters of B can achieve a preferred result if they 
exchange B and D on their ballots (that is, they vote as if their preference 
was D, B, A , C). 
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5. An election with preferences 


3 

2 

2 

2 

A 

B 

C 

D 

B 

D 

B 

C 

C 

C 

A 

B 

D 

A 

D 

A 


is to be decided by Borda count. Who will win? Show that the supporters 
of C can make sure their candidate wins by insincerely voting preference 
(C, A, D, B). 

6. Consider a runoff election with preferences 


4 

4 

2 

3 

4 

A 

A 

B 

B 

C 

B 

C 

A 

C 

B 

C 

B 

C 

A 

A 


(i) Who wins the election? 

(ii) Show that two supporters of A can change the result in favor of their candi¬ 
date by changing their preferences from (A, C, B) to (C, A, B). 

7. Consider the preferences 


24 

16 

19 

21 

11 

10 

A 

B 

C 

D 

E 

E 

B 

A 

D 

C 

C 

A 

E 

D 

E 

E 

B 

A 

D 

E 

A 

B 

A 

B 

C 

C 

B 

A 

D 

D 


(i) Suppose everyone votes sincerely. Under the Hare system, who wins the 
election? 

(ii) Show that, if one of the supporters of D votes insincerely by reversing her 
first two preferences, she achieves a preferable outcome. 

8. An election with preference profile 


6 

4 

4 

A 

B 

C 

B 

C 

B 

C 

A 

A 


is decided by Borda count. How many of A’s supporters must change their bal¬ 
lots to (A, C, B)to make their candidate the winner? 
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9. An 18-member committee is to elect one of the four candidates Q, R, S,T. Their 
preference table is as shown. Which candidate wins under pairwise sequential 
voting with agenda (S, T, Q, R)1 


4 

6 

5 

3 

Q 

R 

s 

T 

T 

S 

R 

S 

s 

T 

T 

R 

R 

Q 

Q 

Q 


10. Consider a pairwise sequential election with preferences 


5 

3 

2 

1 

8 

X 

X 

Y 

Z 

T 

Y 

T 

Z 

Y 

Y 

T 

Y 

X 

X 

X 

Z 

Z 

T 

T 

Z 


(i) Who will win under agenda X, Y , Z, T1 

(ii) Who will win under agenda Y, T, Z, X ? 

11. A pairwise sequential election has preference profile 


2 

8 

11 

1 

A 

C 

B 

A 

B 

B 

A 

C 

C 

A 

C 

B 


(i) Who will win under agenda A, B, Cl 

(ii) Who will win under agenda B, C, A? 

(iii) Who will win under agenda C, A, B1 

12. A pairwise sequential election has preference profile 


40 

32 

12 

18 

P 

Q 

R 


Q 

R 

S 

R 

R 

P 

P 

Q 

S 

s 

Q 

P 


(i) Who will win under agenda P, Q, R, S? 

(ii) Who will win under agenda S , R , Q, PI 

(iii) Who will win under agenda S , P, R , Q1 

(iv) Who will win under agenda S , Q, P, R1 
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13 . A club with 46 members wishes to elect a president. The post is currently held 
by Y. The 46 members have preference profile 


16 

12 

10 

8 

X 

Y 

z 

Y 

z 

Z 

z 

X 

Y 

X 

Y 

Z 


(i) The following electoral system is used: a plurality vote is held between the 
candidates other than the President; there is then a majority election between 
the winner of that election and the current President. Show that X will win 
the election. 

(ii) The eight voters whose votes form the right-hand column of the table decide 
to change their votes in an attempt to make Y the winner. Can they do this? 

14 . Suppose X is a Condorcet winner of an election. Can there be an agenda such 
that X does not win under pairwise sequential voting? 

15 . Three committee members must vote to fill one position. There are four candi¬ 
dates, and the preferences are 


ABC 
BCD 
C D A 
DAB 


They decide to use pairwise sequential voting. For each candidate, find an agenda 
such that the selected candidate will win. 

16 . An election is to be held under plurality voting. The preferences profile is 


1 

3 

2 

A 

C 

B 

C 

B 

A 

B 

A 

C 


Can the supporter of A (first column) achieve a preferable result by insincere 
voting? 


10.4 Properties of Electoral Systems 

In this section we shall discuss various properties of electoral systems, most of which 
are related to the accuracy or fairness of the systems. For simplicity we shall restrict 
our attention to elections with a single winner, but most of the ideas can be extended 
to multiple winners, such as the approval and generalized Hare methods. 
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It will be convenient to define an instance of an electoral system to mean an 
election that uses the system on a particular preference profile. 


Polls 

Often an election is preceded by an informal count, or poll. For example, even in 
2007 there were several polls for the 2008 Presidential election. 

If a candidate does badly in polls, his or her supporters may change their votes. 
For example, consider an election that uses plurality voting. There are 3900 voters 
and three candidates, A, B,C. The voters’ actual preferences are 


1800 

1700 

400 

A 

B 

C 

C 

C 

B 

B 

A 

A 


If the election were held immediately, A would win. 

However, suppose a poll is held. As usual, the poll uses the same system (plurar- 
ity voting). Say there are 35 voters in the poll and their preferences are proportional 
to the overall preferences. The result will show A first (18 votes), with B a close 
second (17 votes), and C a distant third (with only 4 votes). 

When the poll results are reported, the 400 voters who favor C may well reason, 
“C cannot win the election, so it would be preferable to elect the better of the other 
two.” They all prefer B to A, so they would vote for B. Suppose 300 of the voters 
decided to change their votes. The vote will be A-1800, B- 2000, C-100, and B will 
win the election. 

But observe that C was a Condorcet winner under the original preferences 

B beats A 2200-1700, 

C beats B 2200-1700, 

C beats A 2100-1800. 

The type of behavior exhibited by C’s followers is frequently observed after polls 
and leads to the following. 

Poll Assumption 

• Voters whose favorite is not one of the two top candidates in a poll will adjust 
their preferences to vote for one of the top two. 

For this reason, those who do badly in polls may drop out before the election. 
In the example, C might easily have dropped out. The Poll Assumption applies pri¬ 
marily to the last poll taken before an election, but candidates frequently drop after 
earlier polls because they do not expect their showing to improve later. 
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If the poll assumption holds, 

• A Condorcet winner will always lose if s/he is not one of the top two candidates 
in the poll. 

• On the other hand, If the poll assumption holds, a Condorcet winner will always 
win if s/he is one of the top two candidates in the poll. 

We say an election satisfies the poll fairness criterion if there is no case where 
the result is changed by a candidate who is not one of the top two drops out before 
the final vote; a system satisfies the poll fairness criterion if every instance satisfies 
the condition. 

The preceding discussion indicates that the candidate who would eventually win 
an election may very well withdraw for what seemed like good reason. Even when 
the candidate does not withdraw, there is no guarantee that an electoral system will 
behave as we expect. The only completely satisfactory method of voting is majority 
rule, and even that is not guaranteed to give a result when there are more than two 
candidates. No matter what system is chosen, there will always be some preference 
profiles for which the method is inappropriate; either it produces no result (even 
when there are no ties) or it has a property that is clearly undesirable. The rest of this 
chapter is devoted to illustrating this fact. 


The Condorcet Winner Condition 

The Condorcet winner condition (or CWC) says 
“If the following can happen: 

• the election has a winner; 

• the election has a Condorcet winner; 

• the two are not the same; 

then the method of election is unfair.” 

This seems a reasonable thing to say, and you might well require that, if a prefer¬ 
ence profile has a Condorcet winner, your electoral system must not elect a different 
candidate. However, many systems do not follow this principle. For example, con¬ 
sider the 3900 voter election, given earlier in this section. The preferences were 


1800 

1700 

400 

A 

B 

C 

C 

C 

B 

B 

A 

A 
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we observed that A would win the election under the plurality method, but C is a 
Condorcet winner. Furthermore, if the runoff method were used, B would receive all 
400 preferences from C, and would win. 

An election in which there is a Condorcet winner is said to satisfy the Condorcet 
criterion if the Condorcet winner wins the election, or (trivially) if there is no winner 
or there is no Condorcet winner. An electoral system satisfies the criterion if every 
instance of it satisfies the criterion. So we have observed that neither plurality nor 
runoff satisfy this criterion. 


Independence of Irrelevant Alternatives 

Independence of Irrelevant Alternatives (or IIA) says, 

Suppose candidate A wins. Suppose the voters then change votes, and now 
B wins. There must be at least one voter who previously preferred A to B 
and now prefers B to A. 

This seems reasonable. But consider the following example. 

Sample Problem 10.10. A Borda count election has preference table 


3 

2 

2 

1 

A 

B 

B 

C 

B 

C 

A 

A 

C 

A 

C 

B 


What is the result? What is the effect if the voters in the left-hand column change 
their preference to A, C, B? 

Solution. Using a 2-1-0 count, A receives 9 points, B receives 11, and C re¬ 
ceives 4, so B wins. After the votes are changed, the scores are A-9, B- 8 , and 
C-7, and A wins. 

In the example, no change was made in the relative positions of A and B , but A 
won the election rather than B. So Borda count does not always satisfy IIA. 

If a voting system satisfies both CWC and IIA, then it might not produce a win¬ 
ner. To see this we consider an example. We do not need to know what electoral 
system is used, we only need to know that it satisfies both CWC and IIA. 

Consider an election with three voters that produces the following preference 
table: 


4 

3 

2 

A 

B 

C 

B 

C 

A 

C 

A 

B 
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This election has no Condorcet winner. In head-to-head contests, A beats B 6-3, B 
beats C 7-2, and C beats A 5-4. 

Now suppose the second group of voters all change their votes by putting C 
above B —the relative positions of A and C are unchanged. The preferences become 


4 

3 

2 

A 

C 

C 

B 

B 

A 

C 

A 

B 


In this case C is a Condorcet winner (C beats B 5-4, and the other results are 
unchanged). By CWC, if the election has a winner, it must be C. So A cannot win 
this new election. Therefore, IIA tells us, A could not have won the original election. 

However, suppose we instead considered interchanging A and C in the third 
group’s preferences. A is now a Condorcet winner (beating C 6-3), and the same 
reasoning as before shows that B did not win the original. 

And if we consider interchanging A and B in the first voter’s preferences, we can 
show in the same way that C did not win the original. 

We have shown that, if a voting system satisfies both CWC and IIA, it cannot 
produce a winner, even for the simple preference profile we have given. 


Monotonicity 

Suppose an election is held and A wins. The only change made in the preference lists 
is to move A higher (nearer to first preference) in some lists. Surely A will win the 
new election? If this is true we say the election is monotone , or say it satisfies the 
monotonicity criterion. A system is called monotone if every instance is monotone. 

Not all systems are monotone. In particular, the runoff system and Hare system 
are not. As an example, consider a runoff election with preference profile 


6 

1 

1 

5 

2 

B 

A 

A 

C 

C 

A 

C 

B 

B 

A 

C 

B 

C 

A 

B 


If the preferences are followed, B will be eliminated, and A defeats C easily, 14-7. 
However, suppose the supporters of A convince the two electors whose ballots read 
C, A, B to move A to the top of their ballots. Then C is eliminated, and in the runoff 
B beats A, 11-10. 
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Pareto Condition 

Vitello Pareto (1848-1923), a French-Italian economist and sociologist, proposed 
the following: if everybody prefers candidate A to candidate B , then B should not be 
elected. This Pareto condition is not satisfied by sequential pairwise voting. Here is 
an example. Suppose three voters have preferences 


A 

C 

D 

B 

D 

A 

C 

A 

B 

D 

B 

C 


If the agenda D, A, C, B is used, then D beats A 2-1, C beats D 2-1, and B beats 
C 2-1, so B is elected. But every voter prefers A to B. 

Sample Problem 10.11. Suppose a sequential pairwise election does not satisfy 
the Pareto condition. Show that there are at least four candidates. 

Solution. Suppose there were an example with three candidates, A, B , and C. 
Suppose every voter prefers A to C, but C wins the election. The last comparison 
made must involve C (in order for C to win) and B (C could not beat A). If B 
is the last member in the agenda, then the first comparison is between A and C, 
and C would be eliminated. So the agenda must be A, B,C (or, equivalently, 
B, A, C); in the first comparison, B beats A, and in the second, C beats B. 

Suppose the preference profile is 


X 

y 

z 

A 

A 

B 

B 

C 

A 

C 

B 

C 


for some nonnegative integers x,y,z. (No other columns are possible because A 
cannot be below C.) Since B beats A, z > x + y. But C beats B, so y > x + z. 
So y > 2x + y, and x must be negative—a contradiction. 


Arrow’s Impossibility Theorem 

A dictator is a person who completely controls a country or an organization. By 
analogy, we define a dictator in a voting system to be a voter with similar power: the 
dictator’s first preference will always be elected. A voting system that has a dictator 
is called a dictatorship. 

If there is a dictator, there is no point in having elections or collecting preference 
lists. However, a dictatorship obviously has some of the properties that we would 
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think desirable. It always produces a result. It is monotone. It satisfies the Pareto 
condition. And it satisfies the independence of irrelevant alternatives. 

Surprisingly, the American economist Kenneth Arrow (b. 1927) proved that a 
dictatorship is the only electoral system for three or more candidates that has all 
these properties. 

Theorem 62 (Arrow’s Impossibility Theorem). For three or more candidates there 
is no electoral system that always produces a winner (rexcept in the case of ties), 
satisfies the Pareto condition, satisfies the independence of irrelevant alternatives, 
and is not a dictatorship. 

(Notice that we do not even need to add monotonicity.) 

Arrow was awarded a Nobel Prize for Economics in 1972. 

Exercises 10.4 

1. A plurality-vote election has preference table 


8 

6 

4 

2 

X 

Y 

Z 

T 

T 

T 

T 

Z 

Y 

X 

Y 

Y 

Z 

Z 

X 

X 


(i) Is there a Condorcet winner? If so, who? 

(ii) Suppose a poll is held and those who place third and fourth drop out. Who 
will win the final election? 

(iii) Suppose a poll is held but only the candidate who placed fourth drops out. 
Who will win the final election? 

2. Fifteen club members are voting to elect a president, using a Borda count. The 
preference profile is 


6 

5 

4 

A 

B 

C 

C 

D 

B 

B 

A 

A 

D 

C 

E 

E 

E 

D 


A poll is conducted; both C and D realize that they are unlikely to win. 

(i) Who would win this election? 

(ii) Who would win if C drops out before the election? 
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(iii) Who would win if D drops out before the election? 

3. A hiring committee uses the Hare system to select a new foreman. The prefer¬ 
ence profile is 


5 

4 

3 

A 

B 

C 

D 

A 

D 

B 

D 

B 

C 

C 

A 


(i) Who would win this election? 

(ii) Who would win if C drops out before the election? 

(iii) Who would win if A drops out before the election? 

4 . Devise a preference profile for three candidates A, B, C with 13 voters, in which 
C wins under plurality but both A and B would beat C 8-5 in runoff elections. 

5. Consider a plurality vote with preference profile 


5 

2 

4 

A 

B 

C 

C 

C 

B 

B 

A 

D 

D 

D 

A 


(i) Who would win this election? 

(ii) Is there a Condorcet winner? If so, who? 

(iii) Does this election satisfy the Condorcet criterion? 

(iv) Suppose a poll is conducted, and C decides to drop out before the election. 
Who would win? 

(v) Does this election satisfy the Condorcet criterion? 

(vi) Does this election satisfy the poll fairness criterion? 

(vii) Does this election satisfy the monotonicity criterion? 

6. A runoff election has preference profile 


13 

9 

5 

11 

X 

Y 

Y 

z 

Y 

Z 

X 

X 

Z 

X 

Z 

Y 


(i) Who would win this election? 

(ii) Is there a Condorcet winner? 
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(iii) Does this election satisfy the Condorcet criterion? 

(iv) Does this election satisfy the monotonicity criterion? 

7. Consider the preference profile 


4 

3 

2 

A 

B 

C 

B 

C 

B 

C 

A 

A 


(i) Who would win this election under the plurality method? 

(ii) By manipulating the third column, show that this election does not satisfy 
independence of irrelevant alternatives. 

8. Consider the preference profile 


4 

3 

2 

A 

B 

C 

B 

C 

A 

C 

A 

B 


(i) Who would win this election under the runoff method? 

(ii) By manipulating the second column, show that this election does not satisfy 
independence of irrelevant alternatives. 

9. Suppose the preference profile 


1 

5 

4 

2 

A 

B 

C 

D 

D 

A 

B 

C 

C 

C 

D 

A 

B 

D 

A 

B 


is used in a Hare system election. 

(i) Show that B would win the election. What is the result of the runoff? 

(ii) Suppose the voters in the final column move A above C. Show that A now 
loses. What is the result? 

10. A Hare election uses the preference profile 


7 

6 

5 

3 

A 

B 

C 

D 

B 

A 

B 

C 

C 

C 

A 

B 

D 

D 

D 

A 
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Show that A wins the election. Consider the result if the voter who preferred D 
moved A up to first preference. Does this prove that this election is not mono¬ 
tone? 

11. Consider the preference profile, in which every voter prefers A to B. 


200 

200 

200 

150 

A 

C 

D 

A 

B 

A 

C 

D 

D 

B 

A 

C 

C 

D 

B 

B 


Find an agenda such that B would win a pairwise sequential election. 

12. Consider a plurality election in which one candidate has an absolute majority 
(first-place in more than half the preference lists). Show that this election satisfies 
the Condorcet and monotonicity conditions. 

13. Does the Condorcet system satisfy the Condorcet criterion? 

14. Prove that the Condorcet system is monotone. 

15. Show that a dictatorship satisfies the Pareto condition. 

16. Prove that the Borda system is monotone. 





Solutions to Practice Exercises 


Chapter 1 


1.1 {x : x 3 — 5x 2 + 1 lx + 6 = 0}, the set of the first three positive whole numbers 
and {1,2, 3} are some answers. 

1.2 72 = 2 3 • 3 2 , 84 = 2 2 • 3 • 7, (72, 84) = 2 2 • 3 = 12. 56 = 2 3 • 7,42 = 2 • 3 • 7, 
(56,42) = 2 • 7 = 14. 

1.3 1,0, l,x 3 , -0.5,0.04. 

1.4 r -2 /f -3 = t 3 /t 2 = t; y 5 ~ 2 = y 3 -, (4x~ 2 )(3x 4 ) = 12x 4 ~ 2 = 12x 2 . 

1.5 3 2 = 9 so log 3 9 = 2; 5 3 = 125 so log 5 125 = 3; Vi = 2 so log 4 2 = ±. 

1.6 9^3 = 3 2 • 32 so log 3 9V3 = | and log 3 (9\/3) 5 = 5 log 3 9\/3 = yor 12.5. 

1.7 2,-2, 5, -4, 3.1, 4.4. 

1.8 J2l=3 *'0‘- 1) = 3 • 2 + 4 • 3 + 5 • 4 = 6 + 12 + 20 = 38; £f =2 i = 
2 + 3+ 4 + 5 + 6 = 20. 

1.9 l+3 + 5 + 7 + 9 = Ef= i 2; - 1; 8 + 27 + 64 + 125 = J2l=2 * 3 - 

1.11 3+7 + ---+43 = E«=i 4/ - 1 =4-£, 1 i 1 (-ll = 4-+±-11 = 264-11 = 
253. 


1.12 (144)5 = 5 2 +4-5+4 = 49, (203) 7 = 2-7 2 + 3 = 101, (112) 3 = 3 2 + 3 + 2 = 
14. 


1.13 (0.242)5 = 2-5- 1 +4-5“ 2 + 2-5“ 3 = 0.4+ 0.16+ 0.016 = 0.576so (using 
the preceding answer) (144.242)5 = 49.576. 


1.14 


5 54 

5 10 + 454 = (204)5 
2 + 0 


6 103 

6 17 + 1 103 = (251) 6 . 

2 + 5 


W.D. Wallis, A Beginner's Guide to Discrete Mathematics , 

DOI 10.1007/978-0-8176-8286-6, © Springer Science+Business Media, LLC 2012 
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1.15 

2 • (0.40625) = 0.8125, first digit 0, 

2 • (0.8125) = 1.625, second digit 1, 
2 - (0.625) = 1.25, third digit 1, 

2 • (0.25) = 0.5, fourth digit 0, 

2- (0.5) = 1.0, fifth digit 1. 

So 0.40625 = (0.01101) 2 = 0.011015. 

1.16 53 = (125)6. 


6- 

(0.12) = 0.72, 

first digit 0, 

6- 

(0.72) = 4.32, 

second digit 4, 

6- 

(0.32) = 1.92, 

third digit 1, 

6- 

(0.92) = 5.52, 

fourth digit 5, 

6- 

(0.52) = 3.12, 

fifth digit 3, 

6- 

(0.12) = 0.72, 

sixth digit 0.. 


The whole process recurs. So 0.12 = 0.(04153)6, 53.12 = (125.04153)6. 

1.17 553.76 = 010110110011.01110110 = 10110110011.0111011. 

1.18 1011011011.01 = 011011011011.0100 = 6D5.4. 

1.19 1.750 x 10 1 , -1.112 x 10 1 , 1.401 x 10°. 

1.20 (1) -8.050 x 10 2 , (2) 1.088 x 10 2 , (3) 1.144 x 10 3 , (4) 3.077 x KT 1 . 

1.21 1.043 x 10 2 + 3.223 x 10 3 = (0.1043 + 3.223) x 10 3 = 3.327 x 10 3 . 

1.22 6.041 x 10 2 + 3.303 x 10 3 = (0.6041 + 3.303) x 10 3 = 3.907 x 10 3 ; 7.007 x 
10“ 4 +4.644 x 10“ 4 = (7.007 + 4.644) x 10“ 4 = 11.651 x 10“ 4 = 1.165 x 
10" 3 . 

1.23 (4.640 x 10 2 ) x (3.020 x 10 4 ) = (4.640 x 3.020) x 10 6 = 14.0128 x 10 6 = 
1.401 x 10 7 in normalized form, noting that the problem was set in a length 4 
system. 

1.24 46 = 101110 2 . Since it is positive, the representation is just 00101110. 

Since —51 is negative, the two’s complement is used. 51 = 110011 2 , which, 
completed to eight places, is 00110011. The one’s complement is 11001100, 
so the two’s complement is 11001101. 

1.25 As 01111001 has first digit 0, it represents a positive number. 

1111001 2 = 2 6 + 2 5 + 2 4 + 2 3 + 1 
= 64 + 32+ 16 + 8 + 1 
= 121 . 
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Since its first digit is 1, 10101101 represents a negative number. To compute it, 
reverse the construction process: subtract 1 to get 10101100 and then comple¬ 
ment to get 01010011. So the number represented is —101001 12 and 

1010011 2 = 2 6 + 2 4 + 2 + 1 
= 64+16 + 2+1 
= 83. 

So the strings represent 121 and —83. 

1.26 8 = IOOO 2 and 4 = IOO 2 . So the representation of 8 is 00001000 and that 
of 4 is 00000100. Thus 4 has complement 11111011, and two’s complement 
11111011 + 1 = 11111100; and —4 is represented by 11111100. Now add 

00001000 8 

11111100 -4. 

100000100 “T 

The first digit (carry digit ) is ignored, as it takes us outside the eight-digit block 
available for integers. So the answer is represented by 00000100, which equals 
IOO 2 or 4, as expected. 

1.27 12.25: First, 12 = IIOO 2 . Next convert 0.25 to binary. 

2 x 0.25 = 0.5, 

2 x 0.5 = 1. 

So the binary form of 12.25 is IIOO.OI 2 , or 1.10001 x 2 3 . The exponent is 3, 
which has excess 127 form 130 = 100000IO 2 . So the exponent is stored as 
10000010. The sign bit is 0. So the IEEE15A form is 

0100 0001 0100 0100 0000 0000 0000 0000. 

—0.3: Converting 0.3 to binary, we get 

2 x 0.3 = 0.6, 

2 x 0.6 = 1.2, 

2 x 0.2 = 0.4, 

2 x 0.4 = 0.8, 

2 x 0.8 = 1.6, 

2 x 0.6 = 1.2, 

and the expression now repeats: 

0.3 = 0.0T00T 


= 1.0011 X 2" 2 . 
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The sign bit is 1. The exponent is —2, which has excess 127 form 125 = 
IIIIIOI 2 , or 01111101. So the IEEE15A expression is 

1011 1110 1001 1001 1001 1001 1001 1001 . 

112.5: 112 = IIIOOOO 2 and 0.5 = 0.1 2 , so in base 2, 112.5 = 1110000.1 = 
1.1100001 x 2 6 . The sign bit is 0. The exponent is 6, which has excess 127 
form 133 = IOOOOIOI 2 . So the IEEE154 expression is 

0100 0010 1110 0001 0000 0000 0000 0000. 

1.28 Converting 0.195625 to binary, we get 

2 x 0.195625 = 0.39125, 

2 x 0.39125 = 0.7825, 

2 x 0.7825 = 1.565, 

2 x 0.565 = 1.125, 

2 x 0.125 = 0.25, 

2 x 0.25 = 0.5, 

2 x 0.5 = 1, 

soO. 195625 = O.OOIIOOI 2 = I.IOOI 2 x2 -3 . The exponent — 3 has excess 127 
form 01111100, so the IEEE15A expression is 

0011 1110 0100 1000 0000 0000 0000 0000, 

or, in hexadecimal form, 3£480000. 


Chapter 2 

2.1 means “the wind is not blowing,” p V q means “the sun is shining or the 
wind is blowing (maybe both),” and A q is “the sun is not shining but the 
wind is blowing.” 

2.2 p A r A~q means “Joseph and Donna are here but Nancy is not;” q V r means 
“either Nancy or Donna is here (maybe both)” (or “at least one of Nancy and 
Donna is here”). 

2.3 For p A^q to be true, the card must be a heart but not an honor—that is, the 
draw is the heart 2, 3, 4, 5, 6, 7, 8, or 9. p v~q is true when the card is either a 
heart or is not an honor, so the draw is one of the minor cards—any 2, 3, 4, 5, 
6, 7, 8, or 9—or the heart ace, king, queen, jack, or ten, a total of 37 draws. 


Solutions to Practice Exercises 


383 


2.4 It is convenient to write s = (~p A ~r). 


p 

q 

r 

~P 


s 

(q v s) 

(pv (qv s ) 

T 

T 

T 

F 

F 

F 

T 

T 

T 

T 

F 

F 

T 

F 

T 

T 

T 

F 

T 

F 

F 

F 

F 

T 

T 

F 

F 

F 

T 

F 

F 

T 

F 

T 

T 

T 

F 

F 

T 

T 

F 

T 

F 

T 

T 

T 

T 

T 

F 

F 

T 

T 

F 

F 

F 

F 

F 

F 

F 

T 

T 

T 

T 

T 


2.5 


p q _ (p v~g) (g -> />) _ (p v~<?) (<? -»• p) 

T T F T T T 

T F T T T T 

F T F F F T 

F F T T T T 


2.6 _ 

P q _ py_q (pAq) ~(p v g) _ (p A g) A ~(p v g) 

T T T T F F 

T F T F F F 

F T T F F F 

F F F F T F 

2.7 {x, y, z), {x, y], {x, z], {y, z], {x}, {y}, {z}, and 0. 

2.8 Z\Z+ = {0, -1, -2,...,}, Z n Z+ = Z+, (Z+\E) U n = {odd positive 
integers) U {2}. 

2.10 Suppose x e R n (S U T). Then x e R and x e (S U T ), so either x e S, 
and x e R, so x e R fl 5); or x e T, and x e R, so x e R IT T. Either have 
x e (R n S) or x e (R IT T), so x e (R (T S) U (R n T). So 

x€RH(SUT) => x e (RD S)U (RDT). 

Conversely, suppose x e (R n .S') U (R n T). If x ^ R, then certainly x ^ 
R D S, and x ^ R fl T. So we can be sure x e R. If x ^ (S U T ), then x £ S, so 
x ^ (R fl S ), and x ^ 7, so x ^ (R fl T); but x must belong to one or the other 
of these sets, so x e (S U T ). So we’ve shown x € R fl (S U T). So 

x e (R n S) U (R (T T) => xeRn(SUT). 

Together these prove R (T (S U T) = (R fl S) U (R (T T). 
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2.12 


A 

B 

C 

AUC 

a n b nc 

£ D (A U C) 

T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

T 

T 

F 

F 

T 

F 

F 

T 

F 

F 

F 

T 

T 

T 

F 

T 

F 

T 

F 

F 

F 

F 

F 

F 

T 

T 

F 

F 

F 

F 

F 

F 

F 

F 


There is no line with T in the second-last column and F in the last. 


2.14 




2.15 The argument is not valid: the sets might be as follows: 



2.16 



2.17 {(1, 1), (1.4), (1, 5), (2, 1), (2,4), (2, 5)}. 

2.18 The case n = 1 is 1 = l 2 , which is obviously true, so the formula gives the 
correct answer when n = 1. Suppose it is true when n = k: therefore 

1 + 3 + • • • + (2k — 1) = k 2 . 
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Then 


1 + 3 + • • • + (2k — 1) + (2k + 1) — k 2 + (2k + 1) 

= (k + l) 2 , 


and the formula is proven correct when n = k + 1. So, by induction, we have 
the required result. 

2.20 Suppose the proposition P(n) means n 2 > 2n + 1. Then P( 3) means 9 > 7, 
which is true. Now suppose k is an integer greater than or equal to 3, and P(k) 
is true: k 2 > 2k + 1. Then (k + l ) 2 = k 2 + 2k + 1 > (2k + 1) + (2k + 1) 
(assuming P(k )) = 4£ + 2, and when k > 3, obviously \k + 2 > 2 (k + 1) + 1 
(in fact, this is true for every positive integer k). So P(k + 1) is true. Therefore 
P(k) implies P(k + 1), and the result follows by induction. 

2.21 Suppose P(n) means 7 divides 3 2n —2 n . Then R(l) is true because 3 2 —2 1 = 7. 
Now suppose k is any positive integer, and P(k ) is true: say 3 2k — 2 k = lx , 
where x is some integer. Then 3 2k + 2 — 2 k+l = 9 • 3 2k — 2 • 2 k = 7 • 3 2k + 2 • 
3 2k — 2 • 2 k = 1 • 3 2 ^ + 2 • lx, which is divisible by 7. So the result follows by 
induction. 

2.22 fn+4 = fn+3 + fn+2 = (fn+2 + fn+1 ) + fn+2 — 2/^+2 + /rc +1 = 2(f n +\ + 

/n) + A+l = 3/ n+ i + / w . So / n+4 is a multiple of 3 if and only if f n is. If 
P(n) means f^ n is divisible by 3, then .P(l) is true (P( 1) means f\ is divisible 
by 3; in fact, f\ = 3), and P(k) true implies P(k + 1) true, so P(n) is true for 
all positive integers, by induction. If Q(n) means fan-i is not divisible by 3, 
then ,P(1) is true (2(1) means /3 is not divisible by 3; in fact, /3 = 2), and 
Q(k) true implies Q(k + 1) true, so Q(n) is true for all positive integers, by 
induction. Similarly, we may prove that fAn-2 and fa n - 1 are not divisible by 3. 
The four statements together give the required answer. 


Chapter 3 

3.1 See the laws of logic in Section 2.1. Rules Bl, B2, B3, and B4 for Boolean 
algebras are the commutative and associative laws for v and A, the distributive 
laws, and the identity laws, respectively. We noted in that section that p p is 
a contradiction, and it is also easy to see that p v~p is a tautology; these give 


B5. 


3.2 xy'z'x. 

3.3 x = x 1 

= x(x+x') 
= XX + xx' 
— xx + 0 

= XX 


by B4(b) 
by B5(a) 
by B3(b) 
by B5(b) 
by B4(a). 
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3.4 We prove the first law. The second follows by duality. 


x + xy = xl + xy 

= x(y + y') + xy 
= xy + xy' + xy 
= xy + xy + xy' 

= *(y + y) +*y' 

= xy + xy' 

= x(y + y’) 

= X 1 
= X 


by B4(b) 
by B5(a) 
by B3(b) 
by B 1(a) 
by B5(a) 

by Theorem 12(a) 
by B3(b) 
by B5(a) 
by B4(b). 


3.5 yz + yz = y(z + z) (by the Distributive law) = yz (by the Absorption law 
applied to z + z). 

3.6 0(1, 0, 1) = 10 + Oil' = 10 + 010 = 0 + 0 = 0; 0(1, l, 0) = 11 + 101' = 

1 + 0 = 1;0(O, 1, 1 ) = 01 + 110' = 01 + 111 = 0+1 = 1 . 

3.7 xy'x'z' = xx'y'z! (commutative) = OyV = 0; x'yz'y = x'yyz' (commutative) 
= x'yz' which is prime. 

3.8 

Step 1: (x'y)'(x' + xyz') = (x + y')(x' + xyz') by de Morgan’s laws. 

Step 2: (x + y')(x' + xyz') = x(x' + xyz') + y'(x' + xyz') = xx' + xxyz' + 

y'x' + y’xyz !. 

Step 3: xx 7 = 0; xxyz! — xyz!\ y’x’ is fundamental; y'xyz' = y'yxz' = 0. So 
the expression equals xyz' + x'y'. 

Step 4: Neither term includes the other, so we are finished. 

3.9 xy = xy 1 = xy(z + z') = xyz + xyz', so 

xy + 0 = xy + xyz + xy'z + xyz' + x'y'z' 

= xy + xyz + xyz' + xy'z + x'y'z' 

= xyz + xyz' + xy'z + x'y'z' 

= P- 

The only fundamental products included in xy are x and y. To show that 
x + 0 ^ 0, it is sufficient to observe that if x = 1, y = z = 0, we have 
x + 0 = 1,0 = 0. For the form y, the substitution x = 0, y = z = 1 gives 
y + 0 = 1,0=0. 

3.10 xy' + xz! + yz = xy'(z + z') + x(y + y')z' + (x + x')yz = xy'z + xy'z' + 
xyz! + xy’z! + xyz + x’yz = xy’z + xy’z! + xyz' + xyz + x'yz . 



y 

y’ 


y 

y' 

X 

i 


X 


1 

x’ 



x' 

i 
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3.12 The maps for the two forms are 



y 

y' 


y 

y' 

X 

CD 


and x 

(i 

i) 

x' 



x' 




respectively. The basic rectangles are marked. There is no change in the first 
expression: xy + x'y' is already minimal. The second map gives the minimal 
expression xy + xy r = x, as expected. 

3.16 An appropriate circuit is 


x 

y 

z 


x'+ xz 


3.17 In Sample Problem 3.14 we saw that the given expression has minimal form 
yz r + y’z. A suitable circuit is 




yz'+y'z 


Chapter 4 

4.1 s < t corresponds to {(1, 1), (1, 2), (1, 3), (1, 4), (2, 2), (2, 3), (2, 4), (3, 3), 
(3, 4)}. j > t corresponds to {(1, 1), (2, 1), (2, 2), (3, 1), (3, 2), (3, 3)}. 

4 2 

* pz is not defined. 

tor = {(1,6), (2, 6), (2, 7)}. 

P U a = {(3, 6), (3, 7), (4, 6), (4, 7), (5, 7)}. 

4.3 {6, 3), (7, 3), (7, 5)}. 

4.4 Both = and > are reflexive. = is symmetric while > is antisymmetric. Both are 
transitive. 

4.5 Every sum of two members of A is again a member of A, except that a + b 
is not in A whenever a = \/2 or b = V2. So a 8 b is true if and only if both 
a and b are in Z. That is, 8 = Z x Z. The relation 8 is neither reflexive nor 
irreflexive because 1 8 1 is true and \/2 8 \fl is false. The relation is symmetric 
and transitive. 
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4.6 The relation is obviously reflexive. To say “A has the same volume as and 
“B has the same volume as A” mean the same thing so the relation is symmet¬ 
ric. If A has the same volume as B and B has the same volume as C, then all 
three solids have the same volume, so A a B and B a C together imply A a C, 
and the relation is transitive. So o is an equivalence relation. 

4.7 The classes are [a] = {a, —a}. (Notice that [0] has only one element.) There 
are infinitely many of them, one for each nonnegative integer. 

4.8 If alb and b l a are both true, then a \ b and b \ a are both true, so (as pointed 
out in Sample Problem 4.8) a = b, which contradicts a lb. So us asymmetric. 
Therefore it is antisymmetric. Transitivity follows from the transitivity of |. So 
? is a partial order relation. Since a l a never holds, it is strong. As 2 ? 3 and 3 ? 2 
are both false, 2 and 3 are not comparable, so it is not total. 

/l(/lO)) = fl(x ) SO /l(/l) = /l- 

h{U(xj) = f 2 ( 1/(1 -X)) = 1/[1/(1 -X)] = l-x = Mx) 

SO /2(/4) = h- 

fsifeW) = f 5 ((x - l)/x) = ((x - l)/x)/[((x - D/x) - 1] 

= ((* - l)/x)/(-l/x) = (x- l)/(—1) = 1-x 
= h(x) SO MU) = h also. 

4.10 M x ) — 1 — x. If (x) = y, then y — 1 —x sox = 1 —y and f-f^iy) — 1 — y. 
So / 3 -‘ = m 


Chapter 5 

5.1 {3,4, 5, 6 }. 

5.2 The outcomes are {H, T}. This set has two elements, so it has 2 2 = 4 subsets. 
So there are four events. 

5.3 In the obvious notation, {HO, HI, H2, TO, T 1 , T2}. 

5.4 
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5.6 



(a) (b) 


5.8 \M U E\ = \M\ + \E\ - \M n E\ = 12 + 7-3 = 16, so 16 readers were 
surveyed. 

5.9 The figures are represented by the following diagram. As only readers were 
surveyed, there is no need for any “outside area” outside M U E. 



M E 


5.11 



20 like horror and police procedural movies, but do not like romances. 20 like 
romances only. 10 like none of these three types. 

5.12 Use the function /(1) = a , /(2) = b , and so on: f(n) equals the nth letter. 

5.13 g is one-to-one: if x\ and X 2 are integers and g(x i) = g(x 2 ), then 2x\ = 2 x 2 
so xi = X 2 . g is onto: if y is any even integer, then \y e Zand g(^y) = y. 

5.14 The function /, defined by f(x) = 2x + 1, is a one-to-one correspondence 
between the countably infinite set Z and O. 

5.15 6 ! = 6-5-4-3-2-1 = 720. 

( 6)4 = 6-5-4-3 = 360. Or using the formula: ( 6)4 = 6!/(6 — 4)! = 6!/2! = 
720/2 = 360. 

5.16 Simply n\. 

5.17 Treating the committee as an ordered set of three elements chosen from the 
12-element set of members, the answer is ( 12 ) 3 , or 1320. 
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5.18 The boys can be ordered in 3! = 6 ways, and the girls can be ordered in 4! = 24 
ways. As the table is circular, it does not matter whether the boys are to the left 
or to the right of the girls. So there are 3! x 4! = 6 x 24 = 144 arrangements. 

5.20 There are ( 10)3 boys’ committees and (13)3 girls’ committees, for a total of 
(10 ) 3 x (13)3 = 720 x 1716 = 1235520 choices. 

5.22 There are three A’s, two A’s, and one A, for a total of six letters. So the number 
of orderings is 6!/(3!-2!) = 60. 

5.23 If unlimited numbers of each color were available, there would be 3 4 = 81 so¬ 
lutions. It is necessary to exclude the solution with four blue marbles, BBBB. 
So the answer is 81 — 1 = 80. 

5.24 C(9,5) = |f^f = 126.0 = SBT = 1- 

5.25 She must choose four of the last seven questions, so Qj = 35 ways. 

5.26 There are C( 8 , 5) choices of where to place the l’s, so the answer is C( 8 , 4), 
or 70. 

5.27 You can choose the mysteries in ( 3 ) ways and the westerns in Q ways. So you 
can choose in ( 3 ) x Q = 10 x 35 = 350 ways. 

5.28 The three consonants can be chosen in ( 3 ) = 10 ways, and the vowels in (^) = 
3 ways. After the choice is made, the letters can be arranged in 5! = 120 ways. 
So there are 10-3-120 = 3600 “words.” 

5.29 The committee contains one man or no men. With one man, the number of 
choices is 3 x Q = 63 (3 ways to choose the man, Q to choose the women). 
With no men, there are Q = 35 possibilities. So there are 63 + 35 = 98 
possibilities. 

5.31 Q = = 3240. 

5.32 First, place one coin in each box. Then distribute the eight remaining coins 
between four boxes. This can be done in ( 3 ) = 56 ways. 

5.33 Assuming there are at least 14 books of each color, ( 14 ^ 5 1 _1 ) = (^) = 3060. 
5.35 There are ( 13 5 ^ 5 1 _1 ) = (^) = 2520 solutions. 

5.37 (x - y) 6 = 0/ - (+v 5 + (+V - (+V + 0* V - (+ 5 y+(+ 6 = 

y 6 - 6xy 5 + I5x 2 y 4 - 2lx 3 y 3 + I5x 4 y 2 - 6x 5 y + x 6 . 

5.38 We write 0.99 8 = (1 — 10~ 2 ) 8 . Then it equals 

1 -8-10 _2 + 28-10 _4 -56-10" 6 + ---. 

Every subsequent term is at most one-tenth of the one before it. So the approx¬ 
imate value is 

1 - 0.08 + 0.0028 - 0.000056 + • • • = 0.923 approx. 
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5.39 There are 2 10 possible ways to choose a subset of the ten books. However, the 
subsets with 8 , 9, or 10 elements are not allowed. So the number is 



5.41 In the notation of the Sample Problem, we want to find | 56 U S 7 U 5s U S 9 1. For 
any x, 15*1 = 9 9 , if x ± y, \S X D S y = 8 9 |. Similarly | S x D S y D S z \ = 7 9 , and 
so on. So, from (5.12) the sum is 



and the answer we require is 


4-9 9 — 6 - 8 9 + 4-7 9 — 6 9 . 


5.43 Again let P[ be the set of permutations with i fixed points. No permutation can 
have exactly three fixed points (if three numbers are in their natural position, 
then the fourth must also be). So the answer is \P 4 \-\-\P 2 \ = /(4, 4) + /(4, 2 ) 


(4)A)(4)£>2=l+6-l=7. 


5.44 Partition the set of people into seven subsets, the first subset being those whose 
birthdays fall on a Monday this year, the second those whose birthdays fall on 
a Tuesday, and so on. Since there are more people in our set than there are days 
in the week, at least one subset contains at least two people; that is, at least two 
people have birthdays on the same day of the week. 


Chapter 6 

6.1 The outcomes are equally likely, with probability Events E and G con¬ 
tain two outcomes each ({4, 5} and {3, 5}, respectively) and F contains three 
({2,4, 6 }) so P(E) = l P(F ) = i, P(E) = 

6.2 Write AT to mean there is an X on the quarter and a T on the nickel (where 
X and Y might stand for H or for T). There are four equally likely outcomes, 
namely HH, HT , TH , TT, and two of them ( HT , TH) are in the event. So 
the probability is P(E) = | = 

6.3 There are 52 cards in a deck of which 12 are picture cards. So 



6.4 Define the outcome vy to mean “there is an x on the first die and a y on 
the second.” There are nine equally likely outcomes: each of x and y can be 
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1, 2, or 3. If the event Ej means “the sum is j,” then there are five events 


(£+ £3, £4, £5, Eft) and 


Ei 

= {11}, 

P(E 2 ) = 

Ei 

= {12,21}, 

P(Ei) = 

Ea 

= {13,22,31}, 

P(E 4 ) = 

e 5 

= {23, 32}, 

P(E S ) = 

e 6 

= {33}, 

P(E 6 ) = 


1 

9’ 

2 

9’ 

3 

9’ 

2 

9’ 

1 

9’ 


6.5 There are five non-blue marbles out of nine, so 


P(E) = — 

|S| 


5 

9' 


6.6 


P(E U F) = P(E ) + P(F) - P(E H F) 
= 0.7+ 0.3+ 0.1 =0.9, 
P(E) = 1 - P(E) 

= 1 -0.7 = 0.3. 


6.7 The data can be represented as 



M 

F 

s 

0.19 

0.22 

u 

0.28 

0.31 


The required probabilities are (i) P(S ); (ii) P(F ); (iii) PCS' U £). From formu¬ 
las, P(S) = P(SnM) + P(SnF) = 0.19 + 0.22 = 0.41; P(F) = P(+n,S) + 
P(FfMJ) = 0.22 + 0.31 = 0.53; and P(SUF) = P(5) + P(F)-P(5nF) = 
0.41 + 0.53 — 0.22 = 0.72. This can also be answered by adding the entries in 
the appropriate cells. 

6.8 The possible outcomes of the first subexperiment—the selection of die—are A 
and B , with probabilities each If A is selected, the probability of any of the 
rolls 1,2, 3,4, 5, 6 is g. If 5 is selected, rolls 1 and 6 each have probability ^. So 
there are 8 outcomes—Al, A2, A3, A4, A5, A6, B 1, £6—and the probability 
distribution is 


P(A1) = ^ ~6 


1 

12 9 
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So 


P(A2) = - x - = —, 
2 6 12 

1 1 1 

P(A3) = - x - = —, 
2 6 12 

1 1 1 

7 2 6 12 

1 1 1 

V ' 2 6 12 

1 1 1 

V ' 2 6 12 

1 1 1 

P(B 1) = - x - = 

2 2 4 

1 1 1 

P(B6) — - x - — 


P( 6 ) = P(A6) + P(B6) = 1 + 1 = 1 
1 


P( 3) = P(A3) = —. 


6.9 There are three ways in which exactly one success can occur: the sequences 
SFF, FSF , and FFS. As 

12 2 4 

P(SFF) = — x — x — = —, 

3 3 3 27 

2 12 4 

P(FSF) = — x — x — = —, 

3 3 3 27 

2 2 14 

P(FFS) = — x — x — = —, 
v 3 3 3 27 

the probability of exactly one success is 

4 4 4 _ 12 

27 + 27 + 27 “ 27' 


6.10 In this case each call is a Bernoulli trial with p = so the five calls in a day 
can be thought of as a binomial experiment with p = ^,n 


P (3 successes) = 
P(4 successes) = 
P(5 successes) = 



1 


5. So 
10 


= 10 x — = 

32 32 

, 1 5 

32 32 

1 1 

= 1 x — = —, 
32 32 
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and the probability of at least two successes is the sum of these: 

10 + 5 + 1 _ 16 _ 1 
32 “ 32 “ 2' 


6.11 See the Sample Problem. Continuing with our experimental arithmetic 


n 3»+ 1 (3 + n) 
n — 9 : - 


6561 x 12 
262, 144 


= 0.30..., 


which is just too big. Clearly the next term will be smaller than 0.3 (if you 
check, you will find that the n = 10 term is 0.24...). So he needs to make 10 
calls. 

6.12 There are 11 marbles, so there are (3 1 ) ways of selecting three. The number 
of selections with one of each color is 5 x 4 x 2 = 40. So the probability is 
40/(3*) = 8 / 33 - 

6.13 There are 10 balls, so two can be chosen in C(10, 2) =45 ways. Two blue balls 
can be selected in C(4, 2) = 6 ways. So the probability of no white ball—two 
blue—is 6/45 = 2/15, and the probability of at least one white is 1 — ^ = ||. 

6.14 The number of spade flushes is ( 1 5 3 ), and similarly for the other suits. So the 
total number of flushes is 4 x ( 1 5 3 ). As noted in the Sample Problem, the total 

5 J. So the probability of a flush is 


4 X (* 5 3 ) _ 4 - 13 - 12- 11 • 10-9 _ 33 

( 5 /\ ~~ 52-51 -50-49-48 ~~ 16,660’ 


or about one chance in 500. 

/ 18 \ 

6.15 There are ( 3 j possible committees. The number of committees with all math 
majors is ( 3 ); for economics majors it is ( 3 ); and for computer science majors, 
(3) . So the probability is 


( 3 ) + ( 3 ) + (D = 

(* 3 8 ) 816' 

6.16 There are 4! = 24 ways the students can choose backpacks, so the probability 
that they all choose correctly is ^. As we saw, the probability that none choose 
correctly is So the required probability is 

1 3_24 — 9—1_14_ 7 

~~ 24 ~~ 8 “ 24 “ 24 “ 12’ 

6.17 After the first experiment there are 51 cards left: 12 spades and 13 hearts. So 
the probability of a spade as the second card is ^; of a heart, ^. 
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6.18 We write S for “a spade” and N “a card other than a spade ” The probabilities 
are 


12 

/>(* i = Tl , 

13 

P(S I N) = —, 

and, by subtraction, 

39 

P(N 1 5) = 5? 

38 

P(N | N) = —. 

So the diagram is 

2 _ 12 = 12 _ 

4 51 204 

1 39 _ 29 

4 51 204 

3 13 _ 39 

4 51 204 

3 38 _ 114 

4 51 204 


12 ^OS P(S,S ) = 
P(S,N) = 

. ._ Tl^° S P(N ^ = 

4 A/^38 

5!^ON P(N,N) = 


6.19 Write D for the event that the committee has both Democrats, R for both Re¬ 
publicans, and M for a mixed committee. There are (®) = 28 possible commit- 
tees, so |S| = 28, while \D\ = 3, \R\ = 10, and \M\ = 15. So P(D) = 
There are 15 committees that contain at least one Democrat, so the probability 
that both are Democrats, given that one is, is ^ or 

6.20 P(D n R) = P(R I D)P(D) = 

6.21 From the data, 

3 

v ) g , 

1 

V ) g, 

4 

no = 

P(A n B) = 0, 
p(A nc) = ^, 
p(Bnc) = o. 


No two are independent. 
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6.22 In Sample Problem 6.20 we saw that P(F) = 0.6 and P(F \ Y) = 0.02. We 
also knew that P(X ) = 0.4, and we have just been told P(F \ X ) = 0.015. So 

P(F) = P(x n F) + P(Y n F) 

= P(X)P(F I X) + P(Y)P(F I Y) 

= 0.4 x 0.015 + 0.6 x 0.02 = 0.006 + 0.012 = 0.018. 

6.23 Use B, F, H,T for biased, fair, heads, tails. Then 

P(H ) = P(H | F)P(F) + P(H | B)P(B) 



2 

P(B nH) = P(H I B)P(B) = -. 

So P(B | H) = |/| = l 

6.24 We know that P(X) = P(Y) = 0.5. From Bayes’ formula, 

P(B)P(X | B) = P(X)P(B | X), 
0.5 x P(X | B) = 0.5 x 0.4, 

P(X | 5) = 0.4. 


6.25 



R 

W 

R 

W 


Ax .5 = .02 = P(R I F) P(F) 


.6x3 = .02 = P(R I P) P(P) 


From the Figure, the denominator is 0.04. So 


P(F | R) = 


P(R | F)P(F) 
004 


0.02 

004 


= 0.5. 


6.26 We want P(D \ V). From the tree diagram in the Sample Problem, 

0.112 

P(D|y)= 0^53 = °- 17 - 
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6.27 



H 

T 


B 

2 

9 

l 

9 

1 

3 

U 

1 

3 

1 

3 

2 

3 


5 

4 



9 

9 



To find P(B | H ), look at column H, which has a total of |. Then look at the 
(B, H) entry and take the ratio 


/>(£ | H) = 


P(B n H) 
P(H ) 


2 

_9 

5 

9 


2 

5' 


6.28 We use the abbreviations T (tests positive), N (tests negative), D (has the dis¬ 
ease), and H (is healthy). We want to find P(D \ T). From the data we know 
the following: 


P(T | D) = 0.95, 
P(T | H) = 0.05, 
P(D) = 0.05, 

So 


P(T | H)P(H) + P(T | D)P(D) 

(0.95)(0.05) 

““ (0.05)(0.95) + (0.95)(0.05) 

0.0475 _ 0.0475 _ 

“ 0.0475 + 0.0475 ~ 0.095 “ °’ ' 

This example shows that, in some cases, the test gives no useful information. 


P(N | D ) = 0.05, 
P(N | H) = 0.95, 
P(H) = 0.95. 


P(T | D)P(D) 


Chapter 7 


7.1 The diagrams are 



Relation ~ yields a graph, 


1 O—<—0 2 1 

/\ 

4 O—>—0 3 4 

(ii) - 


gives a looped graph, 



, and ~ a simple digraph. 
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7.2 


2 3 1 3 2 3 3 1 



22322 333 1 


7.3 5, 3, 3, 1, 1, 1 is impossible. Suppose there were such a graph. Write x for the 
vertex of degree 5 and y for the first vertex of degree 3. If x ~ y, there are 
7 edges with one endpoint in {x, y} and the other outside. So the sum of the 
degrees of the other vertices must be at least 7. But it is only 6—too small. If x 
and y are not adjacent, the deficiency is even greater. The other two are graph¬ 
ical. Here are realizations 


1 2 2 2 1 



7.4 


o= 

D 



The walk must start (or finish) at T because there is 
only one bridge to that island. With a little experi¬ 
mentation we find the solution TXYZXYX. 



7.6 The original network is shown in Figure (i). We start from A, and randomly 
choose the walk AGJKHDA. After these edges are deleted, Figure (ii) re¬ 
mains. 


We now start at D (alphabetically, the first vertex remaining that was in the 
first walk and is not yet isolated). One walk is DBCFEHGD, and its deletion 
leaves Figure (iii). 

Finally, walk FILKIHF uses up the remaining edges. 



Putting these together, we get AGJKHDBCFILKIHFEHGDA. 
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7.7 The original is shown on the left. The two black vertices need another edge. 
They are not joined, so one edge will not suffice. So eu(H ) > 1. The right- 
hand picture shows a Eulerization that requires just two edges. So eu(H ) = 2, 
and the right-hand picture is a good Eulerization. 




7.8 In a city’s road network, a cutpoint represents an intersection such that all traffic 
between two areas has to pass through that intersection. As such, it might get 
a traffic officer assigned in peak hours, or special priority might be given to 
servicing its traffic lights. 

7.9 C has no cutpoint or bridge, so k(C) > 2 and k'(C) > 2. However, it may be 
disconnected by removing either the two edges incident with the top left vertex, 
or the other endpoints of those two edges. So k(C) = k'(C ) = 2. 

D has a cutpoint, the central point, so tc(D) = 1. There is no bridge, but there 
are several ways to disconnect D by deleting two edges, so k'(C) = 2. 

7.10 C has minimum degree 2, so /c(C), tc'(C), 8(C) = 2, 2, 2. D also has minimum 
degree 2, so k(D ), k'(D ), 8(D) = 1, 2, 2. 

7.11 There are four different-looking vertices, shown in black. The distance of every 
other vertex from the black one is shown in each case. (The top left and second 
from left vertices are actually equivalent, but this is not easy to see.) 



1 2—2 2 1 2—2 2 2 1 — 2^2 2 ^ 2 — 1^1 


7.12 The graph (b) of Sample Problem 7.11 has D = R = 2. The center of W n has 
eccentricity 1, while every other vertex has eccentricity 2, so D = 2, R = 1. 
In C 2 rc+i, every vertex is distance n from the two opposite vertices, and the 
distance is smaller for all other vertices, so D = R = n. 

7.15 There are four types of vertex in the first tree and two in the second. The trees 
are shown as they were in the Sample Exercise. 
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3 

4 
4 
3 




7.16 If a tree has four vertices, then the largest possible degree is 3. Moreover there 
are three edges (by Theorem 48), so from Theorem 43 the sum of the four 
degrees is 6 . As there are no vertices of degree 0 and at least two vertices of 
degree 2, the list of degrees must be one of 3, 1, 1, 1 or 2, 2, 1, 1. In the first 
case, the only solution is the star ^ 1 , 3 . The only case with the second degree 
list is the path P 4 . So there are two trees. 

7.17 Select vo = a. Then Si = {a}. Now yo = d as w(a, d ) < w(a, b), so x\ = d 
and S 2 = {a, d}. The tree will contain edge ad. 

Working from S 2 , we get yo = b and y\ = e. Since ab (= voyo) has a smaller 
weight than de (= x\y\), we select X 2 = b. Then S 3 = {a, d, b} and edge ab 
goes into the tree. Similarly, from S 3 , we get V 3 = y 2 = e and S 4 = {a, d, b, e), 
and the new edge is be. Working from S 4 , X 4 = y 2 = c and we add edge be; 
S 4 = {a, d, b, e, c}. The final vertex is /, and the edge is cf. So the tree has 
edges ad, ab, be, be, cf and weight 14. 

7.18 Candidates (i) and (iv) are Hamiltonian. Candidate (ii) is not Hamiltonian be¬ 
cause it contains a repeated vertex, b. Candidate (iii) is not Hamiltonian because 
the graph contains no edge cd. 

7.19 To traverse vertex a, a Hamiltonian cycle in G must contain one of the paths 
bad, bae, or ead. If bad is included, the other edge through d might be dc 
or de; in the former case, neither be nor de can be edges, and the only cycle 
is badefe, while the latter case bars be, and the only cycle is badefc. If bae 
is included, ad is not an edge, so cd and de are edges, so we have the path 
baedc, and the cycle is baedcf. If ead is included, de is not an edge, so dc is 
an edge, and there are two possibilities, adcbfe and adefbe. So there are five 
Hamiltonian cycles. 

Any Hamiltonian cycle in H must contain edges ab and ad because a has 
degree 2. This means bd is not an edge (it would form a triangle), so de must 
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be in the cycle. There are two ways to finish a cycle: bcfe or bfce. So there 
are two cycles: dabcfe and dabfce. 

7.20 The nearest-neighbor algorithm, starting from Evansville, begins with EM be¬ 
cause it has the least cost of the three edges incident with E. The next edge 
must have M as an endpoint, and ME is not allowed (one cannot return to 
E , it has already been used), so the cheaper of the remaining edges is chosen, 
namely MN. The only available edge from N is NS, as E and M have already 
been visited, and the route is EMNSE , with cost $430. 

Starting at Nashville, the first edge selected is NE , with cost $90. The next 
choice is EM , then MS, then SN, and the resulting cycle NEMSN costs 
$410. If you start at St. Louis, the first stop will be Memphis ($110 is the 
cheapest flight from St. Louis), then Evansville, then Nashville, costing $410. 
Lrom Memphis, the cheapest leg is to Evansville, then Nashville, and finally 
St. Louis, for $410. So both St. Louis and Memphis yield the same cycle as the 
Nashville case (with different starting points, and in reverse). 

To apply the sorted edges algorithm, first sort the edges in order of increas¬ 
ing cost: £M($80), EN($90), NM($100), M5($110), ES ($120), NS($ 130). 
Edge EM is included, and so is EN. The next choice would be MiV, but this 
is not allowed because its inclusion would complete a cycle of length 3 (too 
short), so the only other choices are MS and NS, forming route EMSNE (or 
ENSME) at a cost of $410. 

In this example the route ENMSE, with cost $420, does not arise from either 
algorithm. 


Chapter 8 

8.14(2,0,-1) = (8,0,-4), so 4(2, 0,-1) + (1,4,-3) = (8, 0,-4) + 
(1,4, -3) = (9,4, -7). 

8.2 u t = (-1, 3, 0) • (1, 2, 3) = (-1 • 1) + (3 • 2) + (0 • 3) = (-1) +6 + 0 = 5; 
(2u — 3v)d = [(-2, 6, 0) + (—6, 6, —6)] -(l, 2, 3) = (-8, 12, —6)-(l, 2, 3) = 
(-8 • 1) + (24 • 2) + (-6 • 3) = -8 + 12 - 18 = -2. 

8.3 Lrom (2x, 2) = (y, x) we get the two equations 2x = y and 2 = x. The second 
gives x = 2, so the first gives y = 2-2 = 4. 

8.4 —A = “j 1 J ,3 A — B = ^ j, ^ + C is not defined, as B and C are of 

different sizes. 



5 3 

-3 -1 


8.7 CD = DC = 
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8.8 Suppose C has inverse 



Then CE = /, so 


"1 

0" 


y" 


~X y" 


"1 

0" 

_0 

0_ 

_z 

t 


_0 0_ 


_0 

1 


The (2, 2) entries of the two matrices cannot be equal (0 / 1), so no such E 
exists. Similarly, if D has inverse 


x y 
z t 


then 


"0 r 

"x y" 


z t 


"1 

0" 

0 0 

_z t _ 


0 0 


0 

1 


and again we have the impossible equation 0=1. 
8.9 Suppose the inverse is 



Then BC = / means 


"2 

r 

X 

z 


"1 

0" 

1 

i 

_;y 

t 


0 

1 


which is equivalent to the four equations 

2x T y = 1, 2 z + t — 0, 

x + y = 0, z + t = 1. 

The left-hand pair of equations are easily solved to give x = l and y = — 1, 
while the right-hand pair give z — —l and t — 2. So the inverse exists, and is 



8.12 For A: R3 R3 - R 1 ; #2 ** 7?3; 7?1 7?1 - 27?2, R3 ±- 

Rl <r- R1 + #3. For 7*1 R3 ; 7*2 ^ 7*2 - 27*1, R3 <- 

7*3 ^ 7*3 - 2R2. 


8.13 


det(C) = 1- 4—1-3=1; C _1 = 

det(D) = 2- 2 — 4-1=0; no inverse. 


7*3 + 27*2; 
7*3 - 37? 1; 
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8.14 As observed in the Sample Problem, the matrix of coefficients has inverse 

"11 0 " 

10-1. 

_2 1 — 2 _ 

Now 


1 1 0 


"2" 


"4" 

1 0 -1 


2 

= 

1 

1 

<N 

1 

<N 

_1 


1 


4 


so the first system has solution x = 4, y = 1, z = 4. 



1 1 0 

4" 


3 


1 0 -1 

-1 

= 

1 


<N 

1 

<N 

3 _ 


1 

and the second system has solution v 

= 3, y 

= 1 

, z = 


8.16 From the definition of r, the matrix of /3 is 


M T 


10 10 
0 10 0 
10 0 1 


so 


M a Mr 


2 0 11 
110 1 


and M a v Mp = 


1 0 1 
1 1 0 


1 

1 


8.17 We take the vertices of K\ to be {1, 2, 3, 4}. Taking the vertices in numerical 
order, the adjacency matrix is 


0 111 

10 11 
110 1 ' 

1 i i 0_ 

If we take the edges in order (1,2), (1,3), (1,4), (2, 3), (2, 4), (3, 4), the inci¬ 
dence matrix is 

" 111000 " 

10 0 110 

0 10 10 1 ' 

0 0 10 11 
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Chapter 9 

9.1 36 = 2 2 • 3 2 ; 132 = 2 2 • 3 • 11; 47 is itself prime; 55 = 5 • 11. 

9.2 246 = 2 • 3 • 41, so the factors are 1, 2, 3, 6, 41, 82, 123, and 246. 


584 = 2 • 284 + 16, 
284 = 17 • 16+12, 
16 = 1 • 12 + 4, 

12 = 3 - 4. 


So (584, 284) = 4. 

9.4 

156 = 1 . 144 + 12, 162 = 13-12 + 6, 

144 =12-12, 12 = 2 - 6. 

So the GCD is 6. 

9.5 Reverse the solution to 9 . 3 . 

4 = 1 - 16- 12 

= 1 - 16 - (284 - 17 • 16) = 18 • 16 - 284 
= 18 • (584 - 2 - 284) - 284 = 18 • 584 - 37 • 284. 


9.6 Modulo 7, 0 2 = 0, l 2 = 1, 2 2 = 4, 3 2 = 9 = 2, 4 2 = 16 = 2, 5 2 = 25 = 4, 
and 6 2 = 36 = 1. So x = 3 or 4. 

9.7 Modulo 7, 5 2 = 4, so 5 2 + 2 = 6. Modulo 12, 5 2 = 4, so 5 2 + 2 = 6. Modulo 
12, 14 = 2, so 4- 14 = 8, and 4- 14- 9 = -1 = 11. 

9.8 l -1 = 1 and 5 -1 = 5. 2 • 3 = 6 = 0, and 4 • 3 = 12 = 0, so 2, 3 and 4 are 
zero-divisors. 

9.9 Clearly 11 divides 132, 55, and 66, so 55 and 66 are zero-divisors. 


132 = 2 

49 + 34, 

132 

= 1 -91+41, 



49 = 1 

34+15, 

91 

= 2-41 + 9, 



34 = 2 

15 + 4, 

41 

= 4-9 + 5, 



15 = 3 

4 + 3, 

9 

= 5 + 4, 



4 = 1 

3 + 1, 

5 

= 4+1. 



4 - (15 - 

3 • 4) = 4 ■ 

(34- 

2 - 15) - 15 = 4 

•34-9 

• 15 

(49 - 34) 

= 13 • 34 - 

9 • 49 

= 13 - (132 - 2 - 

49) - 9 

•49 


= 13 • 132 - 35 - 49) so 49“ x = -35 = 97. 

1 = 5-4 = 5 —(9—5) = 2-(41 —4-9) —9 = 2-41—9-9= 2-41—9-(91—2-41) 
= 20 • 41 - 9 • 91 = 20 • (132 - 1 - 91) - 9 • 91 = 20 • 132 - 29 • 91 so 
91 _1 = -29 = 103. 
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9.10 R = 385, Ri = 77, R 2 = 55, R 3 = 35. So llx = 1 (mod 5), or 2* = 1 
(mod 5), has the solution x = 3 (mod 5), 55x = 1 (mod 7), or 6 x = 1 
(mod 7), has the solution x = 6 (mod7), 35x = 1 (mod 11), or 2x = 1 
(mod 11), has the solutions = 6 (mod 11). So A = 77-3-3+55-6-2+35-6-4 = 
2193 = 268 (mod 385). 

9.11 The first word starts with T. A key of 3 makes it start TTA..., which is not 
promising. Key 4 makes it TNH. .., not good. Key 5 makes it THE. .., so we 
try, getting 

THEBAR 
BARIAN 
SAREAT 
THEFRO 
NTGATE 

the barbarians are at the front gate. 

9.12 HQWHU WKH PDCH. 

9.13 Looking at the second word: 

CDEFGHIJKLMNOPQRSTUVWXYZAB 
PQRSTUVWXYZABCDEFGHIJKLMNO 
FGHIJKLMNOPQRSTUVWXYZABCDE, 

the only possibility is AND so the cipher has key 2. The message was here and 
now. 

9.14 BIRTHDAY is good. APPLE has repetitions, TEA is too short, BEAD is too 
close to the beginning of the alphabet. 

9.15 112160801020520. (Notice that the first A is coded as 1, while the second is 
01; there is no need for a leading zero in the first letter.) 

9.16 3 2 = 9, 3 4 = 81, 3 8 = 6561, so 3 13 = 3 x 81 x 6561 = 1594323. 3 4 = 5 
(mod 19), 3 8 = 25 = 6, so 3 13 = 3 x 5 x 6 = 90 = 14. 

9.17 Modulo 53, 7 2 = 49 = -4, 7 4 = (-4) 2 = 16, 7 8 = 16 2 = 256 = 44 = -9, 
7 16 = (—9) 2 = 81 = 28, so 7 29 = 7 16 • 7 8 • 7 4 • 7 = 28 • 44 • 16 • 7 == 25. 

9.18 This time we have 

A 3 = 6545 x 39 x 21 + 3910 x 9 x 35 + 3542 x 3 x 3 
= 6623883 = 343 (mod 301070). 

Therefore A 3 = 343, and A = 7. 
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9.20 Calculating modulo 48, 

5 2 = 25, 

5 3 = 125 = 29, 

5 4 = 145 = 1, 


so the exponent is 4. 

9.22 The square root of 449329 is 670.3_This is not an integer, so we start testing 

from x = 671. 


x = 671, 

X 2 -n = 912, 

Vx 2 — n = 

x = 672, 

x 2 — n = 2255, 

Vx 2 -n 

x = 673, 

x 2 — n = 3600, 

\! x 2 — n 


30.1... 
= 47.4... 
= 60. 

So we take y = 60. The factorization is 449329 = 613 x 733. 


Chapter 10 


10.1 A receives 12 votes, B receives 11, C receives four. So (i) there is no majority 
winner (as there are 27 voters, 14 votes would be needed), and (ii) A is the 
plurality winner. In a primary election, A and B are selected to contest the 
runoff. For the runoff, C is deleted, so the preference profile is 


7 

5 

8 

3 

4 

A 

A 

B 

B 

B 

B 

B 

A 

A 

A 


or (combining columns with the same preference list) 


12 

15 

A 

B 

B 

A 


So B wins the runoff. 

10.2 (i) The votes for A, B , C, and D are 13, 12, 7, and 11, respectively, so A would 
win under plurality voting. 

(ii) Under the runoff method A and B are retained, and the new preference 
profile is 


6 

1 

1 

1 

2 

7 

5 

2 

A 

B 

A 

A 

A 

B 

B 

B 

B 

A 

B 

B 

B 

A 

A 

A 
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giving 22 votes to A and 21 votes to B, so A wins again, 
(iii) In the Hare method we first eliminate C, obtaining 


67772752 

ABADDDBD 

DDDAABDB 

BABBBAAA 


Now we eliminate B 


67772752 

ADADDDDD 

DADAAAAA 


So D wins 30-13. 

10.5 There are 32 voters, so the quota is 8. A gets 10 first place votes, so A is elected. 

A has a surplus of 2. The votes are divided in proportion 5 : 5 between A’s two 
lists, so we give one vote to each. After this allocation, the new table is 


1 1 7 8 5 2 
B E B C D E 
E B D D C B 
D D E E E C 
C C C B B D 


B has eight votes, C has eight, D has five, and E has three. No one meets the 
quota. So E (who had the fewest votes) is eliminated. We now have 


1 

1 

1 

8 

5 

2 

B 

B 

B 

C 

D 

B 

D 

D 

D 

D 

C 

C 

C 

C 

C 

B 

B 

D 

B has 11 votes and is elected. The surplus of three votes is distributed in ratio 

1 : 1 : 7 : 2, approximately 0.27 : 

0.27 : 

1.91 

: 0.55. The table is 

0.27 

0.27 

1.91 

8 

5 

0.55 

D 

D 

D 

C 

D 

C 

C 

C 

C 

D 

C 

D 


C now has 8.55 votes and is elected. In total, A, B, and C are elected. 
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10.8 Originally, S is eliminated first, giving 


8 

6 

5 

P 

Q 

R 

Q 

R 

P 

R 

P 

Q 


Next is R , leaving 

8 6 5“ 

P Q P 

Q P Q 

and P wins. 

If one voter switches, the profile becomes 


8 5 15 
P Q R R 
Q R Q S 
R P P P 
S S S Q 


Again S is eliminated, leaving 


8 

5 

1 

5 

P 

Q 

R 

R 

Q 

R 

Q 

P 

R 

P 

p 

Q 


But now Q is eliminated, giving 


8 

5 

1 

5 

P 

R 

R 

R 

R 

P 

P 

P 


and R wins. 

In a head-to-head competition between Q and R , Q would win 14-6 in the 
original election, and 13-7 using the modified preferences. 
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Section 1.1 


1 . T 3 . F 5 . F 7 . F 9 . T 11 . January, June, July 13. m, i, s, p 15. red, white 
blue 17 . (i) Q, M (ii) M (iii) Z + , Z, Q, M (iv) M (v) Q, M (vi) M 19 . 1, 2, 3, 4, 6, 9, 
12,18,36 21.1,2,3,4,6,8,9,12,18,24,36,72 23.1,3,7,9,21,63 25.1,29 
27 . 1, 2, 3, 4, 6, 8, 12, 16, 24, 48 29 . 231 = 3 • 7 • 11, 275 = 5 2 • 11, GCD = 11 
31 . 95 = 5-19,125 = 5 3 , GCD = 5 33 . 88 = 2 3 • 11,132 = 2 2 -3-11, GCD = 44 
35 . 1080 = 2 3 • 3 3 • 5, 855 = 5 • 3 2 • 19, GCD = 45 37 . 1 39 . 41 . 

43.^ 45.2 47. A 49.28 51.-83 53.12 55.78 57.11 59.1.73 

2 

Section 1.2 


I. 2 + 5 + 10 + 17 + 26 + 37 = 97 3. v / 2+v / 3 + v / 4 = 2 + v / 2 + v / 3 

5.1+8+27 = 36 7.3+8+15+24 = 50 9. (10+^)+(ll + -J r )+(12+ 1 L) = 33^ 

II. (2+1) + (3 — 1) + (4+1) + (5 — 1) = 14 15. 2A+n 17.3 A-B 19.42 21.124 
23.416 25. \n(2n 2 + 3n + 7) 27. 2 - 2~ 9 29. \(n + \)(n + 2)(2n + 3) - 1 = 
\n(2n 2 + 9 n + 13) 

Section 1.3 


1. 55 3. 1531 5. 3363 7. 255 9. 95 11. 151 13. 136 15. 6.75 17. 1.01 
19.16.857142 21.136.0625 23.20.083 25.46.48 27.0.25 29.21.625 31.14.6 

33. 37.527_35. 131.2 37. 5.6 39. 7.456790123 41. 101011 43. 110111.11 

45. 1110.110 47. 1111110 49. 1101111 51. 1111110.11 53. 11100011011 

55. 4.04 57. 15.42243205 59. 0.2 61. 315 63. 5040 65. 22.24 67. 42.14 
69.44.2 71. 6C 73. B.83E 75. E.01AEU 77. FF 79 . IB 81. 1.1£07C 
83.40.9 85.924 87.5 AD.C 89.5.2 E6DB 91. 76£.4 _93. 1101000000001 
95.1000100000001 97.10101110.1111111 99.111100100.10 


W.D. Wallis, A Beginner's Guide to Discrete Mathematics , 

DOI 10.1007/978-0-8176-8286-6, © Springer Science+Business Media, LLC 2012 
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Section 1.4 


1. (i) 1.05 x 10 4 (ii) -1.10 x 10“ 2 (iii) 1.11 x 10“ 4 (iv) 1.04 x 10 9 (v) 1.50 x 10“ 2 
(vi) 4.44 x 10 4 3 . 1.1740 x 10 2 5 . 4.2857 x 10 _1 7 . 3.8350 x 10 3 9 . 2.2411 x 
10 2 11 . -2.1200 x 10 1 13. 2.0830 x 10 2 15 . 1.0799 x 10 2 17 . -7.3480 x 
10 1 19 . 1.1230 x 10 3 21 . 6.0025 x 10 3 23 . 1.6264 x 10 4 25 . 6.1022 x 10 3 
27 . -1.2960 x 10“ 3 29 . 9.1494 x 10 2 31 . 1.3733 x 10 3 33 . 3.5258 x 10 4 

35 . 1.4363 xl0“ 2 37 . 1.69xl0 4 39.-3.33X10 1 41.7.78x10° 43.-3.55x 10“* 


Section 1.5 

1 . 00101110, 2 E 3 . 01111111, IF 5 . 11111111, FF 7 . 10000010, 82 
9 . 11110010, F2 11 . 11000111, Cl 13. 10000000, 80 15. 11111100, FC 

17. 00010001, 11 19 . 0 21 . -118 23 . 113 25 . 15 27 . -58 29 . -125 

31.13 = 00001101 33.-16 = 11110000 35.8 = 00001000 37.40 = 00101000 
39 . 01000001 1001 11010000000000000000 = 419D0000 41 . 010000100100 
0000 0000 0000 0000 0000 = 42400000 43 . 0100 0001 1101 1000 0000 0000 0000 
0000 = 4 ID 80000 45 . 11000100100000010011 100000000000 = C4813800 
47 . 21 x 2" 118 49 . -47 x 2" 109 51. 13 x 2 10 53 . ±32767, ±2.15 x 10 9 , 

±9.22 x 10 18 55 . No 


Section 2.1 

1 . _ 

P q _ ~p A q 

FT F F 

T F F F 

FT T T 

F F T F 

3 . _ 

p q _ p ->• q p^- (p^-q) 

T T T T 

T F F F 

FT T T 

F F T T 


5 . _ 

p q ~p ~p —> q p A (~/> — »■ q) 

T T F T T 

T F FT T 

FT T T T 

F F T F F 
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P 

q 


P 





T 

T 

F 


F 

F 



T 

F 

F 


T 

T 



F 

T 

T 


F 

T 



F 

F 

T 


T 

T 

9 . 









P 

q 


r 


p /\q 

(p A q) V r 


T 

T 


T 


T 

T 


T 

T 


F 


T 

T 


T 

F 


T 


F 

T 


T 

F 


F 


F 

F 


F 

T 


T 


F 

T 


F 

T 


F 


F 

F 


F 

F 


T 


F 

T 


F 

F 


F 


F 

F 

11. 









P 

q 


r 


qV r 

p V (q V r) 


T 

T 


T 


T 

T 


T 

T 


F 


T 

T 


T 

F 


T 


T 

T 


T 

F 


F 


F 

T 


F 

T 


T 


T 

T 


F 

T 


F 


T 

T 


F 

F 


T 


T 

T 


F 

F 


F 


F 

F 

19. 









P 

q 


r 

q 

r 

P ->• (q ->• r) 


T 

T 


T 

T 


T 


T 

T 


F 

F 


F 


T 

F 


T 

T 


T 


T 

F 


F 

T 


T 


F 

T 


T 

T 


T 


F 

T 


F 

F 


T 


F 

F 


T 

T 


T 


F 

F 


F 

T 


T 


p q _ py_q r Hpj\q) (pvg)A~(/>Ag) 

T T T T F F 

T F TFT T 

FT TFT T 

F F F F T F 


21 . 
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29 . _ 

p g pyy 
T T F 
TFT 
F T T 
F F F 


31 . _ 

p ~p p -> 

T F F 

FT T 

33 . _ 

p q pWq p A q (p A q) ^ (p V q) 

TFT F T 
F T T F T 

F F F F T 


35 . 

P g 
T T 
T F 
F T 
F F 


P q P A g (p -> q) -> (p Aq) 

T T T 

F F T 

T F F 

T F F 


~P iiP ^ q) ^ (p A q)) V (~p) 
F T 
F T 
T T 
T T 


37 ._ 

p g p q q (p q) 

T T T T 

T F F T 

F T T T 

F F T T 


39 ._ 

p q ~p q -> p ~p -> (q -> p) 

FT T 

FT T 

T F F 

T T T 


T T 
T F 
F T 
F F 


Section 2.2 


1. (i) a , b, c, d , e, g, i (ii) c, e (iii) b, d (iv) a, b, c, d, e, g 3. (i) 1, 2, 3, 4, 5, 6, 7 , 
8, 9 (ii) 1, 2, 4, 5, 6, 7, 9 (iii) 1, 5 (iv) 1, 5 5. Si c ft, ft c ft, ft c ft, ftl c ft, 
ft c ft, ft c ft, ft c ft, ft = ft 7. (i) No; set of all squares of multiples of 5 
(ii) No; S' 31. T 33. F 35. F 37. T 39. T 41. F 
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Section 2.3 



29. (i), (iv), (v) can be concluded. 
Section 2.4 


5. F 7. T 9. F 11. T 13. T 15. (i) (1, 1), (1, 4), (1, 5), (2, 1), (2, 4), (2, 5), 
(3, 1), (3,4), (3, 5) (ii) (1, 2), (1, -2), (-1, 2), (-1, -2) (iii) (1, 1), (1, 2), (1, 3), 
(3, 1), (3, 2), (3, 3), (5, 1), (5, 2), (5, 3), (7,1), (7, 2), (7, 3) 17. (i) S x T = 0 
(ii) 5 = 0 or 7 = 0 or S = T 


Section 2.5 


7.1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144 
Section 3.1 

5. (x + y)z' 7 . x' + y + z 9 . x + yz' + y 11 . (x + y)(y + z)(z + x) 19 . (15) 
x(x' + 5 ) = xy (16) x(x + y)y = xy + yO (17) xy' + z' = (x'z + yz)' (18) 
x(y + xz) = xy + xz 25. (0, 0), (1, 1) 
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Section 3.2 


1. (i) 0 (ii) 0 (iii) 0 (iv) 1 3. (i) 1 (ii) 1 (iii) 1 (iv) 0 5. e l.f 9 . d 11. e 13. / 
15.1 17.1 19./ 21.0 23 .xy'z 25.x/ 27. xz +x'y+ yz 29 . xy + xz + yz 

31. xy r + x'y 33. x + y 35. y + z 37. xz! 39. Yes 41. Yes 43. xyz! + xy'z + 
xy'z' + x'yz' 45. xyz + xyz' + xy'z + x'yz + x'y'z + x'y'z' 


Section 3.3 


1.x 3.x + / 5. y' 7. (i)xz'(ii)xz 9.xy + xz 11.xy' + yz' + zx' 13. xy + x'z 
15. xy + yz' 17. x+y'z 19. xz'+x'z + y'z 21. (i) xy'z' (ii) ? 23. xyt+x'zt + yz?' 
25. y'z'+ x'y'+ y'?' 21. xyz + xy'z'+x'yz'+x'y'zt' + yzt 29.x' 31. (i )xyz'? + 
xy'z?' + xy'z't' + x'y'z? + x'y'zt' + x'yz?, xyz't + xy f t f + x'zt + y'zt' (ii) xyz't' + 
xyz't + xy'zt + xy'zt' + xy'z't' + xy'z't + x'y'zt' + x'y'z'?' + Jc'y'z'r + x'yz't ', 
xy' + xz! + y'z' + y't' + z't' 33. (i) xyz'?' + xyz't + xy'z? + xy'zt' + x'y'z? + 
x'y'z?' + x'yz'?', xyz' + yz'?' + y'z (ii) xyz? + xyz?' + xy'z? + xy'z?' + xy'z't' + 
xy'z't + x'y'z't' + x'y'z't + x'yz? + x'yz?', xy' + yz + y'z' 35. xy + x'y' + yz' 
37. x'y'z? 39. x'yz + x'zt + x'y? 


Section 3.4 


3 . 





17 . xy + [(x + y)' + (xy')']; x' + y 
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Section 4.1 

1 . {(1, 1), (4,2), (9,3)} x a -1 y means x 2 = >> 3 . {(1, 1), (1,2), (1,3), (1,4), 

(1, 5), (1, 6), (1,7), (1, 8), (1, 9), (2, 2), (2,4), (2, 6), (2, 8), (3, 3), (3, 6), (3, 9), 
(4,4), (4, 8), (5, 5), (6, 6), (7,7), (8, 8), (9, 9)} xy~ l y means y divides x 5. {(2, 8), 
(3, 3)} x e -1 y means x + y 2 = 12 7 . a : {(1,1)} j6 : {(1, 2), (2, 3), (3,4)} 

9 . {(1, 2), (2, 2), (2,4), (3, 2), (4, 1), (4, 3)} 11 . {(2, 2), (3, 2), (3,4), (4, 1), (4, 2), 
(4,3)} 25 . (i) a (ii) y (iii) a, ft (iv) a (v) - (vi) a 27 . S 29 . S 31 . ST 33 . RAT 

Section 4.2 


1 . No (try a = 1 , b = 0 , c = — 1 ) 3 . Yes 5 . Yes; weak; not total 7 . Yes; strong; 
total 17 . 7; 4 

Section 4.3 

1 . No 3 . Yes 5 . Yes 7 . Yes 9 . {(( x, y), x + y) : x, y e Z} (a set of ordered pairs 
in which the first element, (x, y), is itself an ordered pair) 13 . not one-to-one, but 
onto 15 . f\ 17 . f~ l (x) = 2 — x if x > 1; f~ l (x) = l/xif0<x<l 

Section 5.1 

1. (i) E (ii) E n F (iii) E + E (iv) E U F (v) E n F (vi) E\F 3. 26 x 
26 x 26 = 17,576 5. 3 x 3 x 3 x 3 = 81 7. (P = pass, F = fail) 

(i) 4; F, PF , PPF , PPP (iii) {F, PE} (iv) {F, PE, PPF} 9. (i) {BBB, BBG , 
BGB, BGG, GBB, GBG, GGB, GGG} (ii) (a) E = [BBB, BBG, BGB, BGG}, 
F = {BBG, BGB, GBB} (b) The oldest child is a boy; the other two are one 
girl and one boy. 11. (i) {22, 21, 12, 20, 11, 02, 10, 01, 00} (ii)(a) E = {21, 20, 
10}, F = {20, 11, 02}, G = {22^20, 11, 02, 00}, E U E = {21, 20, 10, 11, 02}, 
E fl F = {20}, E fl G = {20}, F n G = {22, 00}, (b) E H F: there are exactly 
two heads on quarters and none on nickels, F D G: the tosses are either all heads or 
all tails, (c) No 13. (ii) 9 (iii) {BB, YY, RR} 15. Yes 17. No 19. Yes 21. Yes 
23. (i) No (ii) Yes (iii) No 25. Yes 

Section 5.2 

I . 68 3 . (i) 54 (ii) 41 (iii) 30 5 . (i) 367 (ii) 189 (iii) 871 7 . 25 9 . (i)_3 (iii) 30 

II . 2 13 . (i) 5 (ii) 20 (iii) 18 (iv) 17 15 . (i) 17 (ii) 86 17 . B n C and B HC are 
disjoint sets whose union is C, so | Cj = \B C\C\ + \B DC\. AHB DC and AnBHC 
are disjoint sets whose union is A D C, so | A D C| = |AD5nC| + |An5DC|.So 
|Aucj = |A| + |C|-|Ancj = |A|+(|5nc|+|5nc|)-(|An^nc| + |An5nc|). 
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Section 5.3 

1. m 3.4; 2; 2 5. The map (x, y) —>► (y, x) is one-to-one and onto 7. One possible 
mapping: every nonnegative real x can be written as |_xj + e x , where 0 < f x < 1. 
(c x is the fractional part of x.) Define /(x) = x + 2e x when 0 < e x < and 
fix) = — x + 1 — e x when ^ < e x . 9. (i) max(m, n) < |A U Z?| < m + n 
(ii) 0 < |A n 5| < min(w, n) (iii) max(/?7 — n, 0) < \A\B\ < m (iv) \m — n\ < 

| A + B | < m + n 13. No (in fact, A and B could be disjoint) 

Section 5.4 

1. 120 3.24 5.120 7.40320 9.4 11.20 13. (10) 3 = 720 15. (13) 3 = 1716 

17. 48 19. (i) 3! x 5! = 720 (ii) 4! x 4! = 576 21. (i) 15! (ii) 3! x 4! x 5! x 

6! = 12441600 23. 9!/2!2!2! = 45360 25. 9!/2!2! = 90720 27. 6!/2! = 360 
29. 10!/3!2!2! = 151200 

Section 5.5 

1.56 3.126 5.20 7.35 9.36 11.28 13. C(16,4) = 1820 15. C(6,2) x 
C(12, 3) = 3300 17. C(12, 3) = 792; C(9, 3) = 84 19. C(10, 3) = 120; 

C(9, 2) = 36 21. (i) C(49, 5) (ii) 6 23. 2 x (®) = 40 23. 2 x C(6, 3) = 40 
25. (i) mC(n, 2) + nC(m, 2), or mn(m -\-n — 2)/2 (ii) add mn triangles that include 
A for a total of mn(m + n) 31.84. 33. ( ( "^f 2 ) + 35.364 37.1365 

Section 5.6 

1. x 4 -4x 3 + 6 x 2 -4x + 1 3. 1 - lOz + 40z 2 - 80z 3 + 80z 4 - 32z 5 5. 16x 4 + 

32x 3 y + 24x 2 y 2 + 8 xy 3 + y 4 7. x 4 + 4x 2 + 6 + 4x -2 + x ~ 4 9. x 3 + y 3 + z 3 + 

3x 2 y + 3x 2 z + 3xy 2 + 3xz 2 + 3 y 2 z + 3 yz 2 + 6 xyz 11.2 8 

Section 5. 7 


1.62 3. (i) 21 (ii) 31 5.16 7 . 44; 265 9. (i) 7! (ii) 7!D V 15.55 


Section 6.1 


1 . (i) 5/16 (ii) 1/2 (iii) 1/2 3 . 5/36 5 . 1/2 7 . 1/2 9 . 2/5 11 . (i) 9/19 (ii) 9/19 
(iii) 1/19 (iv) 1/38 (v) 6/19 13 . (i) 1/5 (ii) 4/5 (iii) 13/20 15 . (i) 12,13, 14,15, 22, 
23, 24, 25, 32, 33, 34, 35, 42, 43, 44, 45 (ii) 3/16 (iii) mm 1/2 17 . (i) 2/9 (ii) 7/9 
19 . 0.3; 0.8; 0.2 21 . 0.4; 0.9; 0.9 23 . (ii) (a) 0.25 (b) 0.19 (c) 0.36 25 . (i) 0.4 
(ii) 0.1 27 . (i) 0.625 (ii) 0.35 (iii) 0.125 29 . (i) 0.4 (ii) 0.3 (iii) 0.25 31 . (i) 0.5 
(ii) 0.25 (iii) 0.2 
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Section 6.2 

1 . (i) RR: 6/15; RW: 4/15; WR : 4/15; WW: 1/15 (ii) 8/15 3 . (ii) [(16 • 15 • 14) + 
3 • (16 • 15 • 36)]/(52 -51-50) (approx. 22%) 5 . (ii) 20/56 (iii) 32/56 (iv) 1/8 
7 . (i) 8/81 (ii) 48/81 9 . A 4-2 division is more likely 11 . (^)/2 20 (about 1 chance 
in 200) 13 . 1 - [(19/20) 10 + 10 • (19/20) 9 • (1/20)] (approx. 8.6%) 15 . (i) 8(|) 7 ± 
(about 40%) (ii) 8(|) 7 g + (|) 8 (about 74%) 17 . 1 — (0.9) 4 (just over one chance 
in three) 


Section 6.3 

1. (i) A (ii) 15 3.(i)C(10,3)/C(17,3)=^(ii)(10-3.4)/C(17,3) = ^ 5.^ 
7 . (i) C(10,4)/2 10 (ii) C(10, 2)/2 10 (iii) (1 + C(10, 1) + C(10, 2))/2 10 (approx. 
20.5%, 4.4%, and 5.5%) 9 . (i) 1/6 4 (ii) 25/6 4 (iii) 50/6 4 (approx. 0.1%, 1.9%, and 
3.9%) 11 . 10 x 4 5 /C(52, 5) 13 . C(5, 2) • C(4, 2)/[C(9, 4) - 6] = \ 15 . [(®) + 

(l)]/(3°) = 5 17 - (i) §4 (0 TA 19 - ({h 12 ( a PP rox - 35%) 

Section 6.4 

1 . (i) 17/35 (ii) 11/35 (iii) 1/5 3 . (ii) 13/24 5 . 2/9; 2/11; 2/5; 5/11; 2/5; 5/9 
7 . 25/102; 15/34; 25/51; 25/77; 32/51; 45/77 9 . 1/5; 2/5; 1/4; 5/8 11 . (i) 2/5 
(ii) 1/5 13 . (i) 2/3 (ii) 17/30 (iii) 17/36 15 . 3/4; 3/7 17 . 1; 3/7 19 . 2/3; 2/3 
21 . 1/3; 1/3 23 . 0.9; 0.4 25 . 0.85; 0.35 27 . 0.97; 0.63 29 . (ii) 1/4; 1/4; 1/4; 1/4; 
1/16 (iii) Yes 31 . (ii) 0.6; 0.6; 0.6; 0.6; 0.36 (iii) Yes 33 . (i) 0.24 (ii) 0.54 (iii) 0.6 
(iv) 0.25 35 . (i) 4/9 (ii) Yes 37 . (i) Yes (ii) Yes 41 . (i) No (ii) No 43 . True 
45 . False 47 . True 49 . Impossible 


Section 6.5 


1. 0.5,0.5 3. (i) i; i; f; 0(ii) ±; ±; ff (iii) f; 0; f0 5. (i) 1/97 


5’ 5’ 5’ 10’ 10’ 5 


5’ 5’ 5’ 


(ii) 1/17 7 . (i) 1.1% (ii) g 9.2/3 11 . 3/8; 5/8 13 . 9/16 15.8/15 17 . 7/31 
19.3/143 21 . 3/8; 1/4 


Section 7.1 


1 . No (loops); no 3 . Yes; yes 5 . No (loops); yes 7 . Yes; no 9 . Yes; no 11 . No 
(loop on 1); no 13 . No (loops); no 15 . No (loops); no 17 . (i) 323,303,242 
(ii) 1,124,222 (iii) 2,422,233 (iv) 2,332,332,442 19 . n\ n once, 1 n times 23 . No, 
no 25 . Yes, yes 27 . No, no 29 . Graphical, not valid 


Section 7.2 


1 . Yes; Yes 3. Yes; No 5 . Yes; Yes 7 . Yes; No 9 . No 11 . Yes; Yes 13 . Yes; 
Yes 15 . Yes; No 17.2 19.2 21.4 23.5 27.3 
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Section 7.3 

1. sabct ; sabt ; sadefcbt ; sadefct ; safcbt ; safct 3. No 5. (i) 1; 1 (ii) 1; 2 
(iii) 1; 1 7.1; 2 9. 1; 1 11.1; 1 13. 2; 3 15. 2; 2 

Section 7.4 


1. 4; 3; 7; 6; 5; 4 3. D = 1, 7* = 1 5. D = 2, 7? = 2 7 . D = 3, R = 2 9. D = 2, 
7? = 2 11. £> = 4, 7? = 3 13. £> = 4, 7? = 3 15. D = 5, 7? = 4 17. 7£„ + i; 7£ 5 
missing edges 14, 15, 25; Ks 19. There is a path of weight 12 21. There is a path 
of weight 20 23. There is a path of weight 12 

Section 7.5 

1. There are six trees. 5.9 7.15 9.4 17. MST weights: (i) 27 (ii)(a) 38 (b) 41 
(c) 43 19. Weight 28 21. Weight 64 23. Weight 45 25. Weight 31 

Section 7.7 

1. abcdef , abdcef 3. abcehgfd , abcfdgeh , abdfcegh , abdgfceh , abecfdgh , 
adbcfgeh , adbecfgh , adfcbegh , adgfcbeh 5. abcefhgd , abcfehgd , abcfhegd , 
abcfhgde , abcfhged , abdghfce , abecfhgd , abfcehgd , adbcfhge , adghfbce , 
adghfcbe 7. G: abefcd costs 115; 7f: abcfed costs 115 9. NN: a 117, b 117, 
c 122, d 117, e 117, SE: 117 11. NN: a 112, Z? 113, c 112, d 113, e 116, SE: 113 
13. NN: a 74, 6 76, c 76, d 74, e 76, SE: 76 15. NN: a 105, 6 105, c 105, d 100, 
e 100, SE: 105 17. NN: a 158, b 152, c 158, d 158, e 152, SE: 158 

Section 8.1 


3. (-2,2) 5. (9, 18,3) 7. (4, 1) 9. (-6, 0, 6) 11. (7,0,4) 13.(5,-20,10,15) 

15.-1 17.-5 19.3 21.6 23.15 25.2x4 27.2x2 29.No 31.1x4 33.4x4 
35.2 x 4 37. No 39.1 x 3 41.2 x 3 43. [_ ? 3 2 2 ] 45. f 28 " 1 ] 47. [ 2 ~J] 

49. [ _ 2 4 ~ 2 ] 51. [A] 53. [ 2 ] 55. [ 7 -2 ] 57. No 59. No 61. [ l 3 l ] 63. No 

65. x = 2, y = 1 

Section 8.2 


1. Yes if A is square, otherwise no (G’s not the same size) 3. AT? = [_j *], 
HA = [4 7] No 5. A* = [:^ 2 8 ],fiA = [°: 2 4 ] No 7. A* = [_%“] Yes 


9.[_° 20 T[:2- 2 2 ] h-[LT[:I: 4 4 ] * 

[£&]<*>[&£] 27. First and third 


r 1 - 3 9 i 

r —7 -18 15 1 

1 10-4 

, 6 34 -11 

—3 -4 -1-1 

'—5 -11 -6 -1 


16 12] 
20 24 J’ 
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Section 8.3 

1. x = 3, y = 1, any real z 3. x = 2 — z, y = 1 + z, any real z 5. No solutions 
7. No solutions 9. x = 4, y = 3, z = — 3 11. x = 2 — z, y = 1 + z, any real 
z 13. x = 2, y = 4, z = —2 15. x = —3, y = 2 17. x = ^(5 — 3y), any real 
y 19. No solutions 21. * = 2, y = — 1 23. x = 1 + 5z, y = 1 — 3z, any real z 
25. x = 3, y = 2, z = —2 27. x = |, y = g,z = | 29. x = 3, y = —1, z = — 2 
31. x = y = — i,z = 0 33. x = 1 — 2z + A y = 2 — z + 2t, any real z, any real 
t 35. No solutions 


Section 8.4 


1 . Empty 3. Infinite 5. Singleton 7. ^ °j f J 9. [_f 2 3 7 ] 11. No inverse 



r — 1 ° in 


r 4/7 -1 2/7 n 


r 3/2 0 \ 

r 7 -3 -3-1 

13. 

0 10 

L 1 00 J 

15. 

3/7 0 -2/7 

L -5/7 1 1/7 -1 

17. 

-11/6 \ —5/6 19. 

- 17/6 5/6 -1 

-1 1 0 
L-i 0 1 -1 


inverse 23. No inverse 25. 13; [^2/13 7/13 3 ] 27. — 10; [ 1 / 5 ° 1 / 5 ] 29. 0; No 

inverse 31. (ii)(a) x = 2, y = — 1 (b) x = 1, y = 1 33. No 

Section 8.5 
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Section 9.1 


1. 63 : 1, 3, 7, 9, 21, 63; 64 : 1, 2, 4, 8, 16, 32, 64; 288 : 1, 2, 3, 4, 6, 8, 9, 12, 16, 18, 
32,36,96,288 3.123456,51804 5.257,419 7.10 9.60 11.12 = 3-84-4.60 
13. 120 = 4 - 480 - 1 - 1800 15. 24 = 1 - 144 - 1 • 120 17. 21 = 1 - 861 - 4 - 210 
21. 18 23. 21 25. Say p divides a, p > *Ja. Then a/p < +fa, and it is a factor 
of a 
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Section 9.2 


1. 3 3. 0 5.1 7. 1 9. 3 11. 1 13. 5 15. 4 17. 1 19. 1 21. 1 23. 0 

25. 3 27. 0 29. 3 31. 0 33. 1 35. 1 37. 0 39. 1 41. 1 and 131 are their 
own inverses; 3 _1 = 5, 5 _1 = 3, 9 _1 = 11, ll -1 = 9, 2, 4, 6, 7, 8, 10, and 12 are 
zero-divisors 43.5 45.4 47.7 49.3 51.5 53.11 55.1 57.11 59.0 61.9 

63. 298 (mod 385) 65. 236 (mod 455) 67. 1 + 1 = 0 mod 2; 1 + 1 = 1 in B 2 


Section 9.3 

1. Encrypt always implies secrecy; encode need not 3. Wake up and smell the 
coffee. 5. How many more miles must we march 7. It is a truth universally ac¬ 
knowledged that a single man in possession of a good fortune must be in want of 
a wife (Pride and Prejudice, Jane Austen) 9. Like many of the great generals of 
history Caesar seems to have been lacking in cryptographic subtlety (Cryptography, 
Arnold Beutelspacher) 11. VHQG WURRSV 13. WKH HQG LV QHDU 15. at¬ 
tack 17. retreat 19. AOL TWU OHZ YPZLU 21. ZLCLU RUPNOAZ HYL 
HWWYVH JOPUN 23. Do not pass go 25. bread and circuses 27. Make love not 
war 29. Send in the clowns 31. REIPF FHRCG YXCVJ FQSZZ 33. OUGAB 
PFUAK NINYA CGQZZ 

Section 9.4 

I. EKOFMCEP FQ DKLSCEP WLPV ALP EKOPYMS 3. AKTFMDAO FP 
EKLQDAO VLOR BLO AKTOYMQ 5. THADKRTM DO GHJPRTM UJMN LJM 
THAMXKP 7. EFTYIWEK YL MFGNWEK UGKH RGK EFTKXIN 9. AZURE 

II. AORTA contains repetitions 13. DAYLONG is good 15. PIE is too short 
17. LAZY is good but short 19. (i) TP TO RJFFJG PJ TDDQOPNMPC OCPO MGY 
JKCNMPTJGO JG OCPO EX YTMINMFO (ii) HTGY M EJJDCMG CWKNCOOTJG 
RJNNCOKJGYTGI PJ PSC HJDDJVTGI RTNRQTP (iii) PSNCC EJXO MGY HJQN 
ITNDO MNC PJ OTP MDJGI M ECGRS (iv) KNJUC PSMP MGX PVJ YTMIJGMD 
FMPNTRCO RJFFQPC 21. (i) SR SQ XKIIKJ RK SHHTQRPFRN QNRQ FJA 
KLNPFRSKJQ KJ QNRQ OY ASFGPFIQ (ii) DSJA F OKKHNFJ NWLPNQQSKJ 
XKPPNQLKJASJG RK REN DKHHKVSJG XSPXTSR (iii) REPNN OKYQ FJA DKTP 
GSPHQ FPN RK QSR FHKJG F ONJXE (iv) LPKUN REFR FJY RVK ASFGKJFH 
IFRPSXNQ XKIITRN 23. The story of Fermat’s Last Theorem is inextricably 
linked with the history of mathematics (Fermat's Enigma, Simon Singh) 25. Older 
men declare war but it is the youth who must fight and die (Herbert Hoover) 27. The 
optimist believes everything he reads on the jacket of a new book 


Section 9.5 

1. (i) 3, 7, 9 (ii) 17 3. {/?, q } = {11, 13}, s = 61 5. 72 7. Yes 9. No 11. Yes 

13. Yes 15.10 17.24 19.7 21.2 23.4 25.5 27.8 29.4 31.25 33. (i) 13 
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(ii) a. 52 01 49 09 15 01 09 b. 51 14 51 25 15 15 49 c. 15 33 14 23 d. 25 20 52 20 02 
02 20 12 35. DATA 37. (i) 17 (ii) 13 05 05 20 01 20 14 15 15 14 (iii) MEET AT 
NOON 

Section 9.6 

1. 16 3.15 5.11 7.5 9.2 11.27 13.28 15.14 17.613x647 19.523x541 
21. 461 x 653 23. 467 x 631 25. 877 x 983 

Section 9.7 

1. (ii) 32 (iii) 8 3. 9, 1 
Section 10.1 

1. B wins under both methods 3. C received 22 votes, B wins under plurality but 
there is no winner under majority 5. 37; 30 7. (i) 123 (ii) Smith 52, Jones 36, 
Brown 35 (iii) Smith (iv) There is no winner 9. Smith 11. (i) No winner (ii) Y 

(iii) X (iv) In the Borda count, Y would win 13. 6 ; 13 15. No 17. (i) Chinese (ii) 
Italian (iii) Yes, Italian 19. (i) D\ (ii) B 21. (i) 2? (ii) 2? 23. C; A; yes, A 25. B\ 
yes, B 

Section 10.2 

1. 9; C, B 3. 10; B, E 5.10; 2), A, £ 7.8 ; A, C, A 9. 10; B, E, A 11. (i) 
A appointed (ii) no one appointed (i) A appointed (i) A and F appointed 13. (i) 
B, F, H chosen (ii) 6 votes (i) B, F chosen 15. (i) C, E chosen, runoff between 
B, D, J (ii) 7 votes (i) C, E chosen 17. (i) L chosen, runoff between A, B , D, E, J 
(ii) 8 votes (i) L chosen 19. (i) A, B elected; no other candidate elected (ii) A, B 
elected; runoff between C and D 

Section 10.3 

1. (i) No winner (ii) W wins (iii) Y wins (iv) X wins (v) No winner (vi) W wins 
(vii) Y wins (viii) Z wins 3. (i) A (ii) Yes; if they vote B , C, D, A, then C will 
win (iii) Yes; if they vote C, B, A, D, then C will win 5. B wins initially 7. (i) A 
(ii) C now wins 9. S' 11. (i) C (ii) A (iii) B 13. (ii) No. If they choose any list 
where Z is preferred to X , then Z wins; otherwise X wins 15. (i) C (ii) A (iii) B 
17. A: use B , C, D, A; B: use A, C, D , B ; C: use A, B, D, C; D: use C, B , A, D 

Section 10.4 

1. (i) yes, T (ii) Y (iii) X 3. (i) B (ii) B (iii) D 5. (i) A (ii) yes, C (iii) B (iv) no 
(v) no (vi) yes 7. (i) A (ii) if they exchange B and C, B now wins 9. (i) A beats C 
12-5 (ii) B now wins (beating A 9-8) 11. A, C, D, B 


Index 


absolute majority, 339 
absolute value, 7, 8 
absorption law, 70, 74 
acyclic graph, 244 
additive cipher, 314-316 
adjacency, 79, 217 
adjacency matrix, 291-296 
algorithm, 239 
amendments, 364 
AND gate, 87 
antireflexive, 215 
antisymmetric, 96, 102 
approval rating, 352 
approval table, 353 
approval voting, 352-359 
arrangement, 133-139, 143, 158 
arrangements with repetition, 135 
Arrow’s impossibility theorem, 374 
associative law, 68, 95, 106, 264 
associativity, 41 
asymmetric, 96, 103 
atransitive, 96, 103 
augmented matrix, 278, 285 
axiom of choice, 131 

base, 15-20 

Bayes’ formula, 203-214 
Bernoulli trial, 179-183 
biconditional, 34 
binary arithmetic, 24 
binary digit, 141 
binary numbers, 15-20 
binary operation, 67 


binary relation, 93-111, 215 
binomial experiment, 179 
binomial theorem, 151-155 
bipartite, 218 
bipartite graph, 218 
birthday coincidence, 182 
bit, 25, 141 

Boolean algebra, 67-92 
Boolean expression, 73-79 
Boolean form, 73-79 
Borda count, 345-349 
boundedness law, 70 
box diagram, 207-214 
branch, 116 

bridge, 233-236, 245, 249 
bridges of Konigsberg, 223-226 

Caesar cipher, 313, 314 
cancellation law, 274 
carry digit, 26 

cartesian product, 54-57, 60 

ceiling, 7, 8 

change of base, 15-20 

Chinese remainder theorem, 308, 333 

choose, 140-151 

ciphertext, 310 

circuit, 232 

circular relation, 104 

closed walk, 232 

closure, 67 

codomain, 106 

column, 79 

column vector, 266 


W.D. Wallis, A Beginner’s Guide to Discrete Mathematics , 
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Index 


combinatorial proof, 150 
commutative law, 68, 74, 264, 271 
commutativity, 41 
commuting matrices, 271 
comparability, 102 
complement, 25, 68, 217 
complement law, 68 
complement of graph, 218 
complement of set, 39 
complete bipartite graph, 218 
complete graph, 217 
component, 218, 222 
composition, 94, 106 
compound experiment, 116 
conditional, 34 

conditional probability, 191-203 

conditional probability formula, 193 

Condorcet method, 343, 344 

Condorcet winner condition, 371 

congruence, 62, 100 

conjunction, 32, 292 

connected, 222, 233, 245 

connected graph, 218 

connectedness, 233-236 

connectivity, 233-236 

consistent equations, 285 

containment, 103 

coprime, 4 

countable set, 131 

countably infinite, 131 

covering of relations, 97 

cryptanalysis, 311 

cryptography, 310-338 

cryptology, 311 

cryptosystem, 326 

cubic graph, 219 

cut-edge, 233-236 

outpoint, 233-236 

CWC, 371 

cycle, 103, 232-236 

de Morgan’s laws, 43, 71 
decryption, 310 
defining set, 106 
degree, 218-222, 234, 235, 245 
derangement, 158-160, 188 
derangement number, 159 
determinant, 288 
diameter of graph, 237 


dictator, 374 
digital circuit, 86-92 
digraph, 215 
dimension of vector, 263 
directed graph, 215 
disconnected graph, 218 
disjoint sets, 40 
disjunction, 32 
disjunctive form, 73-79 
disjunctive normal form, 76-79 
distance, 236-238, 241 
distinguishable elements, 136 
distributive law, 41, 68, 70 
domain, 106 
dot product, 265 
double precision, 27 

eccentricity, 237 
edge, 215 

edge-connectivity, 233-236 
eight-bit number, 25-30 
election 

primary, 340 
elections, 339-378 
electorate, 339 
ellipsis, 3 
empty graph, 218 
empty set, 39 
encoding, 310 
encryption, 310 
encryption exponent, 333 
endpoint, 217 
equivalence class, 101-105 
equivalence relation, 100-105, 129, 305 
Euclidean algorithm, 299-302, 323 
Euler circuit, 225 
Euler walk, 223-231 
Eulerization, 228 
Eulerization number, 228 
Euler’s theorem, 225-227 
even vertex, 225 
event, 113-121 
experiment, 113 
exponent, 5-7, 20 
exponent part, 27 
extended precision, 27 

factor, 158 
factorial, 133 
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Fermat factorization, 337, 338 

inverse matrix, 272, 286 

Fibonacci numbers, 62 

inverse relation, 95, 108 

finite set, 130 

inverter, 87 

fixed point, 159 

invertible, 273 

floating point, 20-24 

invertible matrix, 286 

floating point arithmetic, 27-30 

involution law, 71 

floor, 7, 8 

irreflexive, 96, 102, 103 

format, 27 

isolated vertex, 218 

full adder, 89, 90 
function, 105-111 

Karnaugh map, 79-86 

fundamental product, 74, 79 

key, 310 

gate, 87 

keyword, 317-322 

Konigsberg bridges, 223-226 

generalized Hare method, 350-359 
generation, 116 

least common multiple, 323 

graph, 215-262, 294-296 

length, 232 

infinite, 235 

length of vector, 263 

weighted, 238 

letter frequency, 318-321 

graphical collection, 219 

literal, 73 

greatest common divisor, 4, 299-302 

logarithm, 6, 7 

half adder, 88 

logic circuit, 87 
logical equivalence, 34 

Hamiltonian cycle, 252-262 

looped digraph, 215 

Hamiltonian path, 252, 254 

looped graph, 215 

Hare method, 342, 343, 345-359 
hexadecimal number, 15-20, 28-30 

majority 

idempotence, 41 

absolute, 339 
simple, 340 

idempotent law, 70, 74 

mantissa, 20, 27 

identity law, 68 

mapping, 106 

identity matrix, 271 

mathematical induction, 57-65 

IEEE754 format, 27-30 

matrices and equations, 277 

IIA, 372 

matrix, 263-296 

image, 106 

minimal form, 75, 83 

incidence matrix, 294 

minimal spanning tree, 246-251 

inclusion, 74 

minimum degree, 234 

inclusion and exclusion, 125, 126 

modular arithmetic, 303-307 

inconsistent, 278 

monotonicity, 373 

inconsistent equations, 285 

monotonicity criterion, 373 

independence, 116, 195 

Monty Hall problem, 209-211 

independence of irrelevant alternatives, 372 

multigraph, 216, 222, 232 

indistinguishable elements, 136 

multiple edge, 222 

infinite graph, 215 

multiple edges, 216 

infinite set, 130 

multiplication principle, 115, 134 

insincere ballot, 362 

multiplicity, 222 

integers, 2 

mutually exclusive, 115, 196 

intersection, 39, 94, 156 
interval, 4 

natural numbers, 3 

inverse function, 108-111, 130 

nearest neighbor algorithm, 258 
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negation, 32 

non-singular, 273 

non-uniform experiment, 168 

non-uniform probabilities, 169 

NOT gate, 87 

null set, 39 

nullity law, 70, 74 

odd vertex, 225 
one-to-one, 107, 129-132 
one-to-one correspondence, 129-132 
one’s complement number, 25 
onto, 107, 129-132 
OR gate, 87 
order relation, 102-105 
outcome, 166, 178, 192 
overflow, 26 

Pareto condition, 374 

partial order, 102 

partition, 40, 101 

Pascal’s triangle, 144-151, 153 

path, 232-244 

pendant edge, 219 

pendant vertex, 218 

permutation, 133, 159 

Petersen graph, 235 

pigeonhole principle, 160-162 

plaintext, 310 

plane, 265 

plurality, 340 

plurality runoff method, 340 
poll assumption, 370, 371 
polls, 370-378 
positive integers, 3 
power set, 39 

predecessor-successor relation, 103 
preference profile, 340 
primary election, 340 
prime implicant, 75-79 
prime number, 4, 131, 297-299 
Prim’s algorithm, 247 
Principle of duality, 69 
principle of inclusion and exclusion, 
156-158 

principle of mathematical induction, 131 
probability, 185 
probability distribution, 166 
product of matrices, 267-277 


profile, 340 
proper subgraph, 217 
proper subset, 131 
proposition, 31 

public key cryptosystem, 326, 327 

radius of graph, 237 
range, 106 
rational numbers, 3 
real numbers, 4 
rectangular array, 266 
reduced key phrase, 318 
reduced keyword, 318 
reduced row echelon form, 284 
reflexive, 96, 100, 102, 129 
regular graph, 219, 222 
relation, 93-111, 291-296 
relative complement, 39 
relative difference, 39 
relatively prime, 4 
residue class, 305 
road networks, 222, 233, 252 
rounding, 21 
row, 79 

row vector, 266 
RSA cryptosystem, 323-338 
rule of sum, 122, 143 
runoff method, 340 

scalar multiplication, 264 

scalar product, 265, 267 

scientific notation, 20-24 

scytale, 311-313 

selection, 140-151, 185 

sequence, 10, 38, 133 

sequential pairwise voting, 361-369 

sequential voting, 340-343 

set, 1-4,38-57, 115 

set-theoretic difference, 39 

shape of matrix, 266 

shortest path, 238-243 

sigma notation, 9-14 

sign bit, 27 

signature system, 329 

simple, 215 

simple majority, 340 

simple walk, 232 

single precision, 27 
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single transferable vote, 353-359 
basic, 353 
dynamic, 353 
singular, 273 
singular matrix, 286 
size of matrix, 266 
skew, 96 

sorted edges algorithm, 259 
spanning tree, 246-251 
splitter, 88 

square matrix, 272, 286 
star, 218, 221,244 
stochastic process, 177-185 
strategic voting, 362 
string, 141 
strong induction, 58 
strong partial order, 102 
subgraph, 217, 233 
subset, 39, 131 
substitution cipher, 317-322 
sum of matrices, 267 
summation formulae, 153, 154 
superset, 39 

symmetric, 96, 100, 129, 215 
symmetric difference, 52-57 
symmetric matrix, 268 
symmetry, 144 

systems of linear of equations, 277-291 

tautology, 34 

term, 74 

total order, 102 

transitive, 96, 100, 102, 129 

transpose, 268 

Traveling Salesman Problem, 257-262 
traversability, 222-231 
tree, 243-251 


tree diagram, 115-121, 178, 192, 196, 205 
truth table, 31-38, 45, 46, 54 
truth value, 31 

two’s complement number, 25-30 

unary operation, 67 
uncountable, 131 
uniform experiment, 166 
uniform probabilities, 166 
union, 39, 94, 121-128, 156 
unity, 68 
universal set, 39 

valency, 218-222 
vector, 263-270 
vector addition, 264 

Venn diagram, 46-52, 54, 123-128, 170 
vertex, 116, 215, 225 
Vigenere cipher, 314 
voting, 339-378 
Condorcet, 343 
majority, 339 
plurality, 340 
sequential, 340-343 
sequential pairwise, 361-369 

walk, 226, 232-236 
weak induction, 58 
weak partial order, 102 
weight, 238, 246 
weighted distance, 238 
weighted graph, 238 
wheel, 221 

Whitney’s Theorem, 234 
zero, 68 

zero matrix, 271 
zero-divisor, 305-307 


